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TITLE OP THE INVENTION 

CELL ABLATION USING 
TJ?ANS-SPLICING RIBOZYMES 

Field of the Invention 

The present invention is directed to novel trans- 
splicing ribozymes and methods of cell ablation using 
these ribozymes. 

BRIEF DESCRIPTION OF THE BACKGROUND ART 

I. Group I Introns 

RNA molecules with catalytic activity are called 
ribozymes or RNA enzymes (Cech, T.R., Ann. Rev. 
Biochem. 59:543-568 (1990). The Tetrahymena thermophila 
precursor rRNA contains an intron (a ribozyme) capable 
of catalyzing its own excision. This ribozyme is one 
of a class of structurally related Group I introns. 

The splicing activity of the modified r. 
thermophila intron requires the presence of a guanosine 
cof actor and a divalent cation, either Mg ++ or Mn , 
and occurs via two sequential transesterif ication 
reactions (Figure 1). First, a free guanosine is 
bound to the ribozyme and its 3' hydroxyl group is 
positioned to attack the phosphorus atom at the 5' 
splice site. The guanosine is covalently attached to 
the intron sequence and the 5' exon is released. 
Second, the phosphodiester bond located at the 3' 
splice site undergoes attack from the newly freed 3' 
hydroxyl group of the 5' exon, resulting in production 
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of the ligated exon sequences. The excised intron 
subsequently undergoes a series of transesterif ication 
reactions, involving its 3' hydroxyl group and internal 
sequences, resulting in the formation of shortened 

circular forms. 

These successive reactions are chemically similar 
and appear to occur at a single active site. The 
reactions of self-splicing are characterized by the 
formation of alternative RNA structures as differing 
RNA chains are each brought to form similar 
conformations around the highly conserved intron. 
Splicing requires the alignment of the intron-exon 
junctions across a complementary sequence termed the 
"internal guide sequence" or IGS. 

The first cleavage at the 5' splice site requires 
the formation of a base-paired helix (PI) between the 
IGS and sequences adjacent the splice site. The 
presence of a U:G "wobble" base-pair within this helix 
defines the phosphodiester bond that will be broken in 
the catalytic reaction of the ribozyme. After cleavage 
of this bond, a portion the PI helix is displaced and 
a new helix, P10, is formed due to complementarity 
between the IGS and sequences adjacent the 3" -splice 
site. An invariant guanosine residue precedes the 
phosphodiester at the 3' splice site, similar to the 
portion of the PI sequence that it is displacing. 
Thus, ligation of the exons occurs in a reverse of the 
first cleavage reaction but where new exon sequences 
have been substituted for those of the intron. It may 
be noted that intron circularization reactions 
subsequent to exon ligation also involve base-pairing 
of 5« sequences across the IGS, and attack mediated by 
the 3' hydroxyl group of the intron «s terminal guanine 
residue (Been, M.D. at al . , "Selection Of 
Circularizaton Sites In A Group I IVS RNA Requires 
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Multiple Alignments Of An Internal Template-Like 
Sequence," Cell 50:951 (1987)). 

II. Catalytic Activities 

In order to better define the structural and 
catalytic properties of the Group I introns, exon 
sequences have been stripped from the "core" of the T. 
thermophila intron. Cech, T.R. et al., WO 88/04300, 
describes at least three catalytic activities possessed 
by the Tetrahymena intron ribozyme: (1) a 

dephosphorylating activity, capable of removing the 3' 
terminal phosphate of RNA in a sequence-specific 
manner, (2) an RNA polymerase activity (nucleotidyl 
transferase) , capable of catalyzing the conversion of 
oligoribonucleotides to polyribonucleotides, and (3) a 
sequence-specific endoribonuclease activity. 

Isolated ribozyme activities can interact with 
substrate RNAs in trans, and these interactions 
characterized. For example, when truncated forms of the 
intron are incubated with sequences corresponding to 
the 5' splice junction, the site undergoes guanosien- 
dependent cleavage in mimicry of the first step in 
splicing. The substrate and endoribonucleolytic intron 
RNAs base-pair to form helix PI, and cleavage occurs 
after a U:G base-pair at the 4th-6th position. 
Phylogenetic comparisons and mutational analyses- 
indicate that the nature of the sequences immediately 
adjacent the conserved uracil residue at the 5' splice 
site are unimportant for catalysis, provided the base- 
pairing of helix PI is maintained (Doudna, J. A. et al., 
Proc. Natl. Acad. Sci. USA 86: 7402-7406 (1989)). 

The sequence requirements for 3' splice-site 
selection appear to lie mainly within the structure of 
the intron itself, including helix P9.0 and the 
following guanosine residue which delineates the 3' 
intron boundary. However, flanking sequences within the 
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3 • exon are required for the formation of helix P10 and 
efficient splicing, as shown by mutational analysis 
(Suh, E.R. et al., Mol. Cell. Biol. 20:2960-2965 
(1990)}. In addition, oligonucleotides have been 
ligated in trans, using a truncated form of the intron, 
and ••external" guide sequence and oligonucleotides 
which had been extended by a 5* guanosine residue. The 
substrate oligonucleotides corresponding to 3- exon 
sequences were aligned solely by the formation of P10- 
like helices on an external template, prior to ligation 
(Doudna, J.A. et al., Nature 335:519-522 (1989)). 

The cleavage activity of ribozymes has been 
targeted to specific RNAs by engineering a discrete 
"hybridization" region into the ribozyme, such 
hybridization region being capable of specifically 
hybridizing with the desired RNA. For example, 
Gerlach, W.L. et al., EP 321,201, constructed a 
ribozyme containing a sequence complementary to a 
target RNA. Increasing the length of this 

complementary sequence increased the affinity of this 
sequence for the target. However, the hybridizing and 
cleavage regions of this ribozyme were integral parts 
of each other. Upon hybridizing to the target RNA 
through the complementary regions, the catalytic region 
of the ribozyme cleaved the target. It was suggested 
that the ribozyme would be useful for the inactivation 
or cleavage of target RNA in vivo, such as for the 
treatment of human diseases characterized by the 
production of a foreign host's RNA. However, ribozyme- 
directed trans-splicing, (as opposed to trans-cleivage) 
was not described or suggested. 

The endoribonuclease activities (the cleavage 
activities) of various naturally-occurring ribozymes 
have been extensively studied. Analysis of the 
structure and sequence of these ribozymes has indicated 
that certain nucleotides around the cleavage site are 
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highly conserved but flanking sequences are not so 
conserved. This information has lead to the design of 
novel endoribonuclease activities not found in nature. 
For example, Cech and others have constructed novel 
ribozymes with altered substrate sequence specificity 
(Cech, T.R. et al., WO 88/04300; Koizumi, M. et al., 
FEBS Lett. 228:228-230 (1988); Koizumi, M. et al., FEBS 
Lett. 239:285-288 (1988); Haseloff, J. et al., Nature 
334:585-591 (1987); and Heus, H.A. et al., Nucl. Acids 
Fes. 28:1103-1108 (1990)). From early studies of the 
self-cleaving plant viroids and satellite RNAs 
(Buz ay an, J.M. et al., Proc. Natl. Acad. Sci. VSA 
83:8859-8862 (1986), guidelines for the design of 
ribozymes that are capable of cleaving other RNA 
molecules in trans in a highly sequence specific have 
been developed (Haseloff, J. et al., Nature 334:585-591 
(1988)). However, these constructs were unable to 
catalyze efficient, targeted trans-splicing reactions. 

The joining of exons contained on separate RNAs, 
that is, trans-splicing, occurs in nature for both 
snRNP-mediated and self-catalyzed group I and group II 
introns. In trypanosoroe and Caenorhabditis eleqans 
mRNAs, common 5* leader sequences are transcribed from 
separate genes and spliced to the 3' portions of the 
mRNAs (Agabian, N. , Cell 61:1157-1160 (1990); Hirsh, D. 
et al., Mol. Biol. Rep. 14:115 (1990). These small 
"spliced leader" RNAs (slRNAs) consist of the 5' exon 
fused to sequences that can functionally substitute for 
Ul snRNA in mammalian snRNP-spl icing extracts. 

Also, both the group I and group II self-splicing 
introns are capable of exon ligation in trans in 
artificial systems (Been, M.D. et al., Cell 47:207-216 
(1986); Galloway-Salvo, J.L. et al. f J. Mol. Biol. 
211:537-549 (1990); Jacquier, A. et al . , Science 
234:1099-1194 (1986); and Jarrell, K.A. et al., Mol. 
Cell Biol. 8:2361-2366 (1988)). Trans- splicing occurs 
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in vivo for group II introns in split genes of 
chloroplasts (Kohchi, T. at al., Nucl. Acids Res. 
26:10025-10036 (1988)), and has been shown for a group 
I intron in an artificially split gene in Escherichia 
coli (Galloway-Salvo, J.L. et al., J. Mol. Biol. 
211:537-549 (1990)). m the latter case, a 
bacteriophage T4 thymidilate synthase gene (td) 
containing a group I intron was divided at the loop 
connecting the intron helix P6a. Transcripts of the td 
gene segments were shown to undergo trans-splicing in 
vitro, and to rescue dysfunctional B. coli host cells. 
Known base-pairings (P3, P6 and P6a) and possible 
tertiary interactions between the intron segments, 
allowed correct assembly and processing of the gene 
halves. 

In vitro, the Tetrahymena ribozyme is capable of 
catalyzing the trans-splicing of single-stranded model 
oligoribonucleotide substrates. Four components were 
necessary: ribozyme, 3- single-stranded RNA, 5- exon 
and GTP. A shortened form of the Tetrahymena ribozyme 
(L-21 Seal IVS RNA), starting at the internal guide 
sequence and terminating at U 409 has been used in such 
a reaction (Flanegan, J.B. et al. f J- Cell. Biochem. 
(Su PP .)12 part D:2B (1988)). Attack by GTP at the 5' 
splice site released the 5' exon which was then ligated 
by the ribozyme to the 3 • exon in a transesterif ication 
reaction at the 3' splice site. 

The in vivo use of ribozymes as an alternative to 
the use of antisense RNA for the targeting and 
destruction of specific RNAs has been proposed 
(Gerlach, W.L. et al., EP321.201; Gotten, M. , Trends 
Biotechnol. 8:174-178 (1990); Cotten, M. et al., EMBO 
J. 8:3861-3866 (1989); Sarver, N. et al . , Science 
247:1222-1225 (1990)). For example, expression of a 
ribozyme with catalytic endonucleolytic activity 
towards an RNA expressed during HIV-1 infection has 
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been suggested as a potential therapy against human 
immunodeficiency virus type 1 (HIV-1) infection 
(Sarver, N. et al.. Science 247:1222-1225 (1990); 
Cooper, M. , CDC AIDS Weekly, April 3, 1989, page 2; 
Rossi, J.J., Abstract of Grant No. 1R01AI29329 in 
Dialog's Federal Research in Progress File 265). 
However, such attempts have not yet been successful. 

In a study designed to investigate the potential 
use of ribozymes as therapeutic agents in the treatment 
of human immunodeficiency virus type 1 (HIV-1) 
infection, ribozymes of the hammerhead motif (Hutchins, 
C.J. et al., Nucl. Acids Res. 14:3627 (1986); Keese, P. 
et al., in Viroids and Viroid-Like Pathogens, J.S. 
Semancik, ed. , CRC Press, Boca Raton, FL, 1987, pp. 1- 
47) were targeted to the HIV-1 gag transcripts. 
Expression of the gag-targeted ribozyme in human cell 
cultures resulted in a decrease (but not a complete 
disappearance of) the level of HIV-1 gag RHA and in 
antigen p24 levels (Sarver, N. et al.. Science 
247:1222-1225 (1990)). Thus, the medical effectiveness 
of Sarver' s ribozyme was limited by its low efficiency 
since any of the pathogen's RNA that escapes remains a 
problem for the host. 

Another problem with in vivo ribozyme applications 
is that a high ribozyme to substrate ratio is required 
for ribozyme inhibitory function in nuclear extracts 
and it has been difficult to achieve such ratios. 
Cotton et al. achieved a high ribozyme to substrate 
ration by microinjection of an expression cassette 
containing a ribozyme-producing gene operably linked to 
a strong tRNA promoter (a polymerase III promoter) in 
frog oocytes, together with substrate RNA that contains 
the cleavage sequence for the ribozyme (Cotton, M. et 
al., EMBO J. 8:3861-3866 (1989).- However, 
microinjection is not an appropriate method of delivery 
in multicellular organisms. 
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The in vivo activity of ribozymes designed against 
mRNA coding for Escherichia coli 0-galactosidase has 
been reported (Chuat, J.-C. et al., Biochem. Biophys. 
R es. commun. 152:1025-1029 (1989)). However, this 
activity was only observed when the ribozyine and target 
were transfected into bacterial cells on the same 
molecule. Ribozyine activity was inefficient when 
targeted against an mRNA transcribed from a bacterial 
F episome that possessed the target part of the 0- 

galactosidase gene. 

Thus, current technological applications of 
ribozyine activities are limited to those which propose 
to utilize a ribozyme's cleavage activity to destroy 
the activity of a target RNA. Unfortunately, such 
applications often require complete destruction of all 
target RNA molecules, and/or relatively high 
ribozyme: substrate ratios to ensure effectiveness and 
this has been difficult to achieve. Most importantly, 
the modified ribozymes of the art are not capable of 
efficient, directed trans-splicing. 

Accordingly, a need exists for the development of 
highly efficient ribozymes and ribozyme expression 
systems. Especially, the art does not describe an 
effective means in which to destroy anexisting RNA 
sequence or to alter the coding sequence of an existing 
RNA by the trans-splicing of a new RNA sequence into a 
host's RNA. 

gTTMM&RY OF THF. TNVENTION 

Recognizing the potential for the design of novel 
ribozymes, and cognizant of the need for highly 
efficient methods to alter the genetic characteristics 
of higher eukaryotes in vivo, the inventors have 
investigated the use of ribozymes to alter the genetic 
information of native RNA 1 s in vivo. These efforts 
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have culminated in the development of highly effective 
trans-splicing ribozymes, and guidelines for the 
engineering thereof. 

According to the invention, there is first 
provided an RNA or DNA molecule, such molecule encoding 
a trans-splicing ribozyroe, such ribozyme being capable 
of efficiently splicing a new 3 1 exon sequence into any 
chosen target RNA sequence in a highly precise manner, 
in vitro or in vivo, and such molecule being novel in 
the ability to accomodate, any chosen target RNA or 3» 
exon sequences, and in the addition of a complementary 
sequence which enhances the specificity of such 
ribozyme. 

According to the invention, there is also provided 
an RNA or DNA molecule, such molecule encoding a 
ribozyme, the sequence for such ribozyme being a fusion 
RNA, such fusion RNA providing a first RNA sequence 
that is sufficient for targeting such ribozyme to 
hybridize to a target RNA, and further a second RNA 
sequence, such second RNA sequence capable of being 
transposed into the target RNA, and such second RNA 
sequence encoding an RNA sequence foreign to the 
targeted RNA sequence. 

According to the invention, there is further 
provided an RNA or DNA molecule, such molecule encoding 
a ribozyme, the sequence for such ribozyme being a 
fusion RNA as described above, the first RNA sequence 
provided by the fusion RNA being a sequence for 
targeting such RNA molecule to hybridize to GAL4 RNA, 
and the second RNA sequence of the fusion RNA providing 
the coding sequence of the A chain of diphtheria toxin 
(DTA) . 

According to the invention, there is also provided 
an RNA or DNA molecule, such molecule encoding a 
conformationally disrupted ribozyme of the invention, 
a pro-ribozyme, such pro-ribozyme being substrate- 
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activated, that is, such pro-ribozyme possessing 
neglible or no self-cleavage or trans-splicing 
activity, until being reactived by specific interaction 

with target RNA. 

According to the invention, there is further 
provided an RNA or DNA molecule containing a ribozyme 
or pro-ribozyme expression cassette, such cassette 
being capable of being stably maintained in a host, or 
inserted into the genome of a host, and such cassette 
providing the sequence of a promoter capable of func- 
tioning in such host, operably linked to the sequence 
of a ribozyme or pro-ribozyme of the invention. 

According to the invention, there is further 
provided an RNA or DNA molecule containing a ribozyme 
or pro-ribozyme expression cassette, such cassette 
being capable of being stably inserted into the genome 
of a host, such ribozyme expression cassette providing 
the sequence of a GAL4 -responsive promoter operably 
linked to the sequence of a ribozyme or pro-ribozyme of 

the invention. 

According to the invention, there is further 
provided a method for in-vitro trans-splicing, such 
method comprising the steps of (1) providing a ribozyme 
or pro-ribozyme of the invention and an appropriate 
substrate for such ribozyme in vitro, (2) further 
providing in vitro reaction conditions that promote the 
desired catalytic activity of such ribozyme or pro- 
ribozyme; and (3) allowing such ribozyme or pro- 
ribozyme to react with such substrate under such 
conditions. 

According to the invention, there is further 
provided a method for in vivo trans-splicing, such 
method comprising the steps of (1) providing an RNA or 
DNA molecule of the invention to a host cell, (2) 
expressing the ribozyme or pro-ribozyme encoded by such 
molecule in such host cell, (3) expressing a substrate 
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of such ribozyme or pro-ribozyroe in such host cell, and 
(4) allowing such ribozyme or pro-ribozyroe to react 
with such substrate in such host cell. 

According to the invention, there is further 
5 provided a method for inactivating the activity of a 

target RNA, such method comprising (1) providing a 
ribozyme or pro-ribozyme of the invention, such 
ribozyme or pro-ribozyme being catalytically active 
against such target RNA, (2) providing such target RNA, 
10 and (3) providing conditions that allow such ribozyme 

or pro-ribozyme to express its catalytic activity 
towards such target RNA. 

According to the invention, there is further 
provided a method for providing a desired genetic 
15 sequence to a host cell in vivo, such method comprising 

(1) providing a ribozyme or pro-ribozyme of the 
invention to a desired host cell, such ribozyme or pro- 
ribozyme being catalytically active against a target 
RNA in such host cell, (2) providing such ribozyme or 
20 pro-ribozyme encoding such desired genetic sequence, 

and (3) providing conditions that allow such ribozyme 
or pro-ribozyme to trans-splice such desired genetic 
sequence into the sequence of the target RNA. 

According to the invention, there is further 
25 provided a method for cell ablation in multicellular 

plants and animals, such method comprising providing a 
ribozyme or pro-ribozyme of the invention to a any host 
cell, and especially into a fertilized embryonic host 
cell, such ribozyme or pro-ribozyme encoding the 
30 sequence of a gene toxic to such host cell and such 

ribozyme or pro-ribozyme being capable of trans- 
splicing with a desired target in such host cell. 

According to the invention, there is further 
provided a method for engineering male or female 
35 sterility in agronomically important plant species, 

such method comprising the ablation of any cell 
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necessary for fertility using a ribozyme or pro- 
ribozyme of the invention. 

According to the invention, there is further 
provided a method of immunizing plants against plant 
pathogens, such method comprising the construction of 
transgenic plants capable of expressing a plant 
pathogen-specific fusion ribozyme or pro-ribozyme of 
the invention, and such ribozyme or pro-ribozyme being 
capable of ablating any host cell infected with such 
pathogen . 

According to the invention, there is further 
provided a transformed, pathogen-resistant 
microorganism, such microorganism being resistant to a 
desired pathogen, such microorganism being transformed 
with a ribozyme or pro-ribozyme of the invention and 
such ribozyme or pro-ribozyme providing a catalytic 
activity that targets a nucleic acid molecule expressed 

by such pathogen. 

According to the invention, there is further 
provided a viral pathogen capable of delivering a 
desired ribozyme or pro-ribozyme activity to a desired 
host, such ribozyme or pro-ribozyme activity being 
delivered by a ribozyme or pro-ribozyme of the 
invention. 

nggraTPTION OF THE F IGURES 

Figure 1 is a diagram of the mechanism of ribozyme 
splicing of the group I intron. 

Figure 2 is a diagram of structure of the (A) 
Tetrahymena thermophila rRNA intron; (B) Target mRNA 
and trans-splicing ribozyme or pro-ribozyme of the 
invention. 

Figure 3(A) is a diagram of the design of a CAT- 
LacZ a-peptide trans-splicing ribozyme; (B) is the 
complete DNA coding sequence of the CAT-LacZ ribozyme. 



Figure 4 presents the sequences of cucumber mosaic 
virus (CMV) RNA 4 trans-splicing ribozymes. A: virus 
RNA target sequences; B: Oligonucleotide target 
sequences; C: CMV RNA4 - diphtheria toxin A-chain 
trans-splicing ribozymes. 

Figure 5 is a comparison of cucumber mosaic virus 
3/4 sequences. 

Figure 6(A) is a diagram of the design of a Gal4- 
Diphtheria toxin A (DTA) trans-splicing ribozyme; (B) 
is the complete coding sequence of the Gal 4 -DTA 
ribozyme with the isoleucine substitution. 

Figure 7 is a diagram of the P-element mediated 
"enhancer-trapping" method for expression of Gal4 
protein. 

Figure 8 presents a partial sequence of wild-type 
DTA and DTA 3' exon mutants. 

Figure 9 is a map of pGaTB and pGaTN. 
Figure 10 is a map of pUAST. 

Figure 11 is a cuticle preparation of a Drosophila 
embryos expressing a Gal4-DTA trans-splicing ribozyme. 

Figure 12 presents the rationale for "pro- 
ribozyme" design. Arrows show sites of ribozyme 
cleavage, "antisense" regions are shown in black, 
catalytic domains are shown with radial shading, and 3' 
"exon" sequences are shown with light shading. In the 
absence of the target mRNA, trans-splicing ribozymes 
may transiently base-pair, and react with heterologous 
sequences (including their own) . In addition, scission 
at the "3' exon" junction will occur. Inactive "pro- 
ribozymes" are constructed to contain extra self- 
complementary sequences which cause the catalytic 
center of the ribozyme to be mis-folded. Active 
ribozymes are only formed after base-pairing with the 
intended target mRNA - and consequent displacement of 
the interfering secondary structure. 
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Figure 13 shows the sequence and predicted 
secondary structure of the CAT-LacZ trans-splicing 
ribozyme. Ribozyme "core" sequences are shaded (after 
Cech, Gene 73:259-271 (1988)). Helices P8 are shown 
for the unmodified ribozyme and pro-ribozymes 1 and 2, 
with 13 and 18 nucleotides, respectively, of sequence 
complementary to the "antisense" region (highlighted) . 

Figure 14 shows (1) active CAT-LacZ ribozyme shown 
schematically, with "antisense", ribozyme domain with 
helix P8 and 3' "exon" sequences; (2) (a) inactive CAT- 
LacZ pro-ribozyme 2 shown with base-pairing between 
sequences in the modified helix P8 and the "anti-sense" 
region; and (b) the active pro-ribozyme, after base- 
pairing with the CAT mRNA, displacement of the helix P8 
- "antisense" pairing, and re-formation of helix P8. 

Figure 15 shows stability of CAT-LacZ pro-ribozyme 
transcripts. Plasmids containing the CAT-LacZ ribozyme 
and pro-ribozyme sequences were cleaved with EcdRl and 
transcribed using T7 or SP6 RNA polymerase and [32- 
P)UTP. Radiolabeled transcripts were fractionated by 
5% polyacrylamide gel electrophoresis in 7M urea and 
25% formamide, and autoradiographed. The ribozyme 
transcripts underwent extensive hydrolysis, primarily 
at the "3' exon" junction. The pro-ribozyme forms were 
25 markedly less reactive. 

Figure 16 shows endoribonuclease activity of CAT- 
LacZ pro-ribozymes. Plasmids containing CAT-LacZ 
ribozyme and pro-ribozyme sequences were cleaved, with 
Seal, and transcribed with T7 or SP6 RNA polymerase. 
Transcripts were incubated for 30' at 37»C, 45»C and 
50»C in 40 mM Tris-HCl pH 7.5, 6 mM MgCl 2 , 2 mM 
spermidine, 10 mM NaCl, 2 mM GTP with radiolabeled CAT 
RNA, transcribed using T7 RNA polymerase from plasmid 
cut with PuvII. Products were fractionated by 5% 
polyacrylamide gel electrophoresis in 7M urea and 25% 
formamide, and autoradiographed. RNA mediated cleavage 
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of the 173 nt (nucleotides) CAT RNA produces 5' and 3' 
fragments of 76 nt and 97 nt, respectively. 

Figure 17 shows the "wild-type" and modified 
helices P8 used for pro-ribozyme design with possible 
base-pairs indicated in schematic form. Those bases 
which are complementary to the "anti-sense" portion of 
the corresponding pro-ribozyme , are shown in bold type. 
The number of complementary bases is listed next to 
each helix. The helices are ordered by the stability 
of the corresponding pro-ribozyme transcripts, as 
measured by the degree of "3' exon" hydrolysis during 
in vitro transcription. 

Figure IB shows the stability of GAL4-DTA pro- 
ribozymes. Plasmids containing ribozyme and pro- 
ribozyme sequences were linearized with Xhol and 
transcribed using T7 RNA polymerase. Transcripts were 
incubated for 60' at 50°C n 40 mM Tris-HCl pH 7.5, 6 mM 
MgCl 2 , 2 mM spermidine, 10 mM NaCl, 1 mM GTP, were 
fractionated by 5% polyacrylamide gel electrophoresis 
in 7M urea and 25% formamide, and autoradiographed. 
Ribozyme transcripts are extensively hydrolysed under 
these conditions, while prb-ribozyme 1 is less so and 
pro-ribozyme 2 is stable. 

pETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 
I. Definitions 

In the description that follows, a number of terms 
used in recombinant DNA (rDNA) technology are 
extensively utilized. In order to provide a clear and 
consistent understanding of the specification and 
claims, including the scope to be given such terms, the 
following definitions are provided. 

Ribozvme , An RNA molecule that inherently 
possesses catalytic activity. 
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Trans-splice . A form of genetic manipulation 
whereby a nucleic acid sequence of a first 
polynucleotide is co-linearly linked to or inserted 
into the sequence of a second polynucleotide, in a 
manner that retains the 3'- 5' phosphodiester linkage 
between such polynucleotides. By "directed- trans- 
splicing or "substrate-specific" trans-splicing is 
meant a trans-splicing reaction that requires a 
specific specie of RNA as a substrate for the trans- 
splicing reaction (that is, a specific specie of RNA in 
which to splice the transposed sequence). Directed 
trans-splicing may target more than one RNA specie if 
the ribozyme or pro-ribozyme is designed to be directed 
against a target sequence present in a related set of 
RNAs. 

Target RNA . An RNA molecule that is a substrate 
for the catalytic activity of a ribozyme or pro- 
ribozyme of the invention. 

Ex pression Cassette . A genetic sequence that 
provides sequences necessary for the expression of a 
ribozyme or pro-ribozyme of the invention. 

stably . By "stably" inserting a sequence into a 
genome is intended insertion in a manner that results 
in inheritance of such sequence in copies of such 
genome . 

o pprable linkage. An "operable linkage" is a 
linkage in which a sequence is connected to another 
sequence (or sequences) in such a way as to be capable 
of altering the functioning of the sequence (or 
sequences). For example, by operably linking a 
ribozyme or pro-ribozyme encoding sequence to a 
promoter, expression of the ribozyme or pro-ribozyme 
encoding sequence is placed under the influence or 
control of that promoter. Two nucleic acid sequences 
(such as a ribozyme or pro-ribozyme encoding sequence 
and a promoter region sequence at the 5' end of the 
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encoding sequence) are said to be operably linked if 
induction of promoter function results in the 
transcription of the ribozyme or pro-ribozyme encoding 
sequence and if the nature of the linkage between the 
two sequences does not (1) result in the introduction 
of a frame-shift mutation, (2) interfere with the 
ability of the expression regulatory sequences to 
direct the expression of the ribozyme. Thus, a 
promoter region would be operably linked to a nucleic 
acid sequence if the promoter were capable of effecting 
the synthesis of that nucleic acid sequence. 

II. Engineering of the Ribozvme of the Invention 

15 The trans-splicing ribozymes, pro-ribozymes and 

methods of the invention provide, for the first time, 
a ribozyme capable of directed trans-splicing into any 
RNA sequence, and especially into mature (non-intron- 
containing) mRNA. The trans-splicing ribozyme as 

20 described herein, with its extended complementarity to 

the target, greatly differs from T. thermophila derived 
endoribonuclease activities described in the art. The 
additional complementarity of the ribozymes of the 
invention confers increased affinity and specificity 

25 for the target and the complementarity is not an 

integral part of the catalytic activity. In addition, 
cleavage occurs efficiently and precisely in the 
absence of denaturants and at high concentrations of 
Mg ++ . 

30 The guidelines described herein for the design of 

trans-splicing ribozymes are conservative, based on the 
well characterized properties of group I self-splicing 
introns and are meant to provide a general scheme for 
the design of any directed trans-splicing ribozyme. 

35 Accordingly, the guidelines presented herein are not 

limited to the group I intron of the T. thermophila 
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pre-mRNA and may be used by one of skill in the art to 
design a ribozyroe of the invention with other group I 
introns using such guidelines and knowledge in the art. 

The native T. thermophila ribozyme (the intron 
sequence) is located from base 53 to base 465 in the 
sequence below of the T. thermophila extra chromosomal 
rDNA: 

TGACGCAATT CAACCAAGCG CGGGTAAACG GCGGGAGTAA CTATGACTCT 
CTAAATAGCA ATATTTACCT TTGGAGGGAA AAGTTATCAG GCATGCACCT 
CCTAGCTAGT CTTTAAACCA ATAGATTGCA TCGGTTTAAA AGGCAAGACC 
GTCAAATTGC GGGAAAGGGG TCAACAGCCG TTCAGTACCA AGTCTCAGGG 
15 GAAACTTTGA CATGGCCTTG CAAAGGGTAT GGTAATAAGC TGACGGACAT 

GGTCCTAACC ACGCAGCCAA GTCCTAAGTC AACAGATCTT CTGTTGATAT 

GGATGCAGTT cacagactaa atgtcggtcg gggaagatgt attcttctca 

TAAGATATAG TCGGACCTCT CCTTAATGGG AGGTAGCGGA TGAATGGATG 
CAACACTGGA GCCGCTGGGA ACTAATTTGT atgcgaaagt atattgatta 
GTTTTGGAGT ACTCGTAAGG TAGCCAAATG CCTCGTCATC TAATTAGTGA 
CGCGCATGAA TGGATTA [SEQ ID NO.l] 

30 (Kan, N.C. et al. f Nucl. Acids Res. 20:2809-2822 

(1982)) . 

As described herein, the directed trans-splicing 
ribozymes of the invention are engineered using the 
catalytic core of this intron. The intron, and its 
catalytic core can be isolated by methods known in the 
art. The catalytic core of the intron, that is, the 
truncated intron, differs form the full-length intron 
only in that it is truncated at the Seal site, thus 
removing the last five nucleotides of the intron. The 
truncated intron RNA may be prepared by techniques 
known in the art or may be purchased commercially in 
kit form from commercial sources such as, for example, 
product #72000 from US Biochemical, Cleveland, 
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(RNAzyme™ Tet 1.0 Kit). This US Biochemical kit 
provides ribozyme and the protocol for the use of the 
ribozyme. Transcribed Tet.l cDNA may be used as the 
substrate for polymerase chain reaction (PGR) 
mutagenesis as described below, to produce a synthetic 
trans-splicing enzyme. 

Substrate specificity of the ribozyme of the 
invention, that is, the ability of the ribozyme to 
"target" a specific RNA as a substrate, is conferred by 
fusing complementary sequences specific to the target 
(substrate) RNA to the 5' terminus of the ribozyme. 

Directed trans-splicing specificity of the 
ribozyme of the invention, that is, specificity in 
trans-splicing a desired foreign sequence of interest 
with the sequence of a target RNA, is conferred by 
providing a new 3 1 exon at the 3 1 terminus of the 
ribozyme. Details of the design are further provided 
below. 

To alter the structural and catalytic properties 
of the Group I introns, exon sequences replace the 
flanking sequence of such introns so that only the 
catalytic core of the intron, the ribozyme, remains. 
The resulting modified ribozyme can interact with 
substrate RNAs in trans. When truncated forms of the 
intron (i.e., the catalytic "core," i.e. truncated at 
the Seal site, removing the last five nucleotides of 
the intron) are incubated with sequences corresponding 
to the 5' splice junction of the native ribozyme, the 
site undergoes guanosine-dependent cleavage in mimicry 
of the first step in splicing. 

Engineering of the ribozymes of the invention 
requires consideration of the four guidelines that 
follow. 

First, a splice site must be chosen within the 
target RNA. In the final trans-splicing complex, only 
the 5 1 portion of the PI duplex is contributed by the 



WO 92/13089 



PCT/US92/00277 



- 20 - 



target RNA. Only a single conserved residue, uracil, 
is required immediately 5' of the intended splice site. 
This is the sole sequence requirement in the target 
RNA. There is no inate structure required of the target 
UNA Mature mRNA may be targeted and the trans- 
splicing reaction performed in the cell's cytoplasm 
rather than in the nucleus against pre-mRNA. This 
obviates the need for high concentrations of ribozyme 
in a cell's nucleus. 

Second, having chosen a particular target 
sequence, compensating sequence changes must be added 
to the 5' section of the ribozyme in order to allow the 
formation of a suitable helix PI between the target and 
ribozyme RNAs. It is highly desired is that the helix 
PI should contain a U:G base-pair at the intended 5' 
splice site, and should be positioned at the 4th, 5th 
(preferred) or 6th position from the base of the helix 
(Doudna, J. A., et al., "RNA Structure, Not Sequence 
Determines The 5' Splice-Site Specificity of a Group I 
Intron, » Proc. Natl. Acad. Sci. USA 86:7402-7406 
(1989), incorporated herein by reference). For the 
native T. thermophila intron, PI extends for an 
additional 3 base pairs past the intended 5' splice 
site, and, in a preferred embodiment, this is 
maintained in the trans-splicing ribozyme of the 
invention. For trans-splicing to be efficient, the 
substrate and endoribonucleolytic intron RNAs must 
base-pair to form helix PI, with a resulting wobble U:G 
base-pair. Cleavage of the target RNA occurs at the 
phosphodiester bond immediately 3' to (after the) U:G 
base-pair. Phylogenetic comparisons and mutational 
analyses indicate that the nature of the sequences 
immediately adjacent the conserved uracil residue at 
th e 5- splice site are unimportant for catalysis, 
provided the base-pairing of helix PI is maintained. 
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Third, the exon sequences flanking the 3' splice 
site must be chosen, and adjustments made in the 5' 
section of the ribozyme, if necessary, to allow the 
formation of a stable P10 helix. While the P10 helix 
may be dispensesd with if necessary, its presence 
enhances splicing and preferred embodiments of the 
ribozyme of the invention retain the P10 helix (Suh, 
E.R. et al., "Base Pairing Between The 3' Exon And An 
Internal Guide Sequence Increases 3' Splice Site 
Specificity in the Tetrahymena Self -Splicing rRNA 
Intron," Mol . Cell. Biol. 10:2960-2965 (1990)). The 
helices PI and P10 overlap along the T. thermophila 
intron IG'S, and the 2nd and 3rd residues following both 
the 5' and 3' splice sites are complementary to the 
same residues in the IGS (Figure 2). While there may 
be some advantage in following this, many natural group 
I introns do not share this constraint, so the choice 
of 3' exon sequences may be determined primarily by 
experimental considerations. Such considerations 
reflect the wide flexibility in choice of splice sites. 
For example, if it is desired to join two sequences at 
a given point, the sequence at such point cannot be 
mutated or otherwise altered by the trans-splicing 
event. Either PI or P10 can be made shorter if the 
overlapping sequences don't otherwise accomodate for 
the desired splice site. 

The sequence requirements for 3 • splice-site 
selection appear to lie mainly within the structure of 
the intron (the ribozyme) itself, including helix P9.0 
and the adjoining 3' guanosine residue which delineates 
the 3' intron boundary. P9.0 is wholly contained within 
the intron sequences and helps define the adjacent 3' 
splice site. For the trans-splicing design, the P9.0 
helix and the rest of the functional RNA elements 
within the intron are not altered. The structural 
characteristics of the P9.0 helix are known (Michel, F. 
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et al., "The Guanosine Binding Site of the Tetrahymena 
Ribozyme," Nature 342:391-395 (1989)). However, 
flanking sequences within the 3' exon are required for 
the formation of helix P10 and efficient splicing, as 
shown by mutational analysis. 

Fourth, a region of complementary sequence is 
placed at the 5- terminus of the trans-splicing 
ribozyme in order to increase its affinity and 
specificity for the target RNA. As shown herein, an 
arbitrary length of around 40 residues has been used. 
Other lengths may be used provided they are not 
detrimental to the desired effect. 

For example, starting with the T. thermophila 
self-splicing intron (diagrammed below): 

l 

pl | U A G C A A 

5 " . .CUCUCUAAA U 

I • I I | • | | A 

GGGAGGUUUCCAUUU 

] I I I I I I ... 

' „ GUAAGGUA. • • 3 

ribozyme core e u « « »^ 

2 

(The "1" and "2" in the above diagram (and in other 
ribozyme diagrams throughout the application) note the 
first and second splice sites, respectively.) 

(1) a "5 ,w site is chosen adjacent to a uracil 
residue within a chosen target RNA. The sequences 
involved in complementarity do not immediately abut 
sequences involved in Pl helix formation but are 
separated, for example, by five nucleotides also 
involved in P10 formation; 

(2) sequences complementary to the chosen RNA are 
fused to the 5« portion of the self -splicing Group I 
intron. Base-pairing between ribozyme and target RNA 
allow formation the of the helix Pl; 

(3) the chosen «3« exon" sequences are fused to 
the 3' portion of the ribozyme, maintaining the 
conserved helix P10; and 
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(4) to increase affinity for the target RNA, if 
desired, a section of extended sequence complementarity 
is fused to the 5 1 portion of the ribozyme to allow the 
formation of 30-40 base-pairs. 

The alignment of the resulting trans-splicing 
ribozyme with its target RNA may be diagrammed as shown 
immediately below. The target RNA sequence represents 
the top line. The ribozyme sequence is aligned below 
it, a continuous sequence wrapping around the lower two 
lines wherein the hybridization of the nucleotides at 
the 5* and 3' ends and PI and P10 of the ribozyme may 
be seen. 
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According to the invention, trans-splicing 
ribozymes can be designed that will trans-splice 
essentially any RNA sequence onto any RNA target. It 
is not necessary that the target contain an intron 
sequence or that the ribozyroe be an intron in the 
target sequence. For example, a strategy for such 
design may include (1) the identification of the 
desired target RNA (2) cloning and/or sequencing of the 
desired target RNA or portion thereof (3) selection of 
a desired coding sequence to trans-splice into the 
target RNA, (4) the construction of a ribozyroe of the 
invention capable of hybridizing to such target using 
the guidelines herein and (5) confirmation that the 
ribozyme of the invention will utilize the target as a 
substrate for the specif ic trans-splicing reaction that 
is desired and (6) the insertion of the ribozyme into 
the desired host cell. 

Choice of a target RNA will reflect the desired 
purpose of the trans-splicing reaction. If the purpose 
of the reaction is to inactivate a specific RNA, then 
such RNA roust be trans-spliced at a position that 
destroys all functional peptide domains encoded by such 
RNA and at a position that does not result in continued 
expression of the undesired genetic sequences. If more 
than one allele of the gene encoding such RNA exists, 
the ribozyroe should preferably be designed to 
inactivate the target RNA at a site common to all 
expressed forms. Alternatively, more than one ribozyme 
may be provided to the cell, each designed to 
inactivate a specific allelic form of the target RNA. 

When only inactivation of the target RNA is 
desired, and not the expression of a new, desired RNA 
sequence, it is not necessary that the foreign RNA 
donated by the ribozyroe provide a sequence capable of 
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being translated by the host cell, and a sequence 
containing translational stop codons may be used as a 
truncated intron, for example, the intron ribozyme 
truncated at the Seal site. 

If the purpose of the trans-splicing reaction is 
to provide a genetic trait to a host cell, then the 
choice of target RNA will reflect the desired 
expression pattern of the genetic trait. If it is 
desired that the genetic trait be continuously 
expressed by the host, then the target RNA should also 
to be continuously expressed. If it is desired that 
the genetic trait be selectively expressed only under 
a desired growth, hormonal, or environmental condition, 
then the target RNA should also be selectively 
expressed under such conditions. 

It is not necessary that expression of the 
ribozyme itself be selectively limited to a desired 
growth, hormonal, or environmental condition if the 
substrate for such ribozyme is not otherwise present in 
the host as the ribozyme itself is not translated by 
the host. Thus, sequences encoded by the RNA donated by 
the ribozyme of the invention are not translated in a 
host until the trans-splicing event occurs and such 
event may be controlled by the expression of the 
ribozyme substrate in the host. 

If desired, expression of the ribozyme may be 
engineered to occur in response to the same factors 
that induce expression of a regulated target, or, 
expression of the ribozyme may be engineered to provide 
an additional level of regulation so as to limit the 
occurrence of the trans-splicing event to those 
conditions under which both the ribozyme and target are 
selectively induced in the cell, but by different 
factors, the combination of those factors being the 
undesired event. Such regulation would allow the host 
cell to express the ribozyme 's target under those 
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conditions in which the ribozyme itself was not co- 
expressed . 

The sequence of the ribozyme domain that 
hybridizes to the target RNA is determined by the 
sequence of the target RNA. The sequence of the target 
RNA is determined after cloning sequences encoding such 
RNA or after sequencing a peptide encoded by such 
target and deducing an RNA sequence that would encode 
such a peptide. Cloning techniques known in the art 
may be used for the cloning of a sequence encoding a 
target RNA. 

The selection of a desired sequence to be trans- 
spliced into the target RNA (herein termed the "trans- 
spliced sequence") will reflect the purpose of the 
trans-splicing. If a trans-splicing event is desired 
that does not result in the expression of a new genetic 
sequence, then the trans-spliced sequence need not 
encode a translatable protein sequence. If a trans- 
splicing event is desired that does result in the 
expression of a new genetic sequence, and especially a 
new peptide or protein sequence, then the trans-spliced 
sequence may further provide translational stop codons, 
and other information necessary for the correct 
translational processing of the RNA in the host cell. 
If a specific protein product is desired as a result of 
the trans-splicing event then it would be necessary to 
maintain the amino acid reading frame in the resulting 
fusion. 

The identification and confirmation of the 
specificity of a ribozyme of the invention is made by 
testing a putative ribozyme 1 s ability to catalyze the 
desired trans-splicing reaction only in the presence of 
the desired target sequence. The trans-splicing 
reaction should not occur if the only RNA sequences 
present are non-target sequences to which such ribozyme 
should not be responsive (or less responsive) . Such 
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characterization may be performed with the assistance 
of a marker such that correct (or incorrect) ribozyme 
activity may be more easily monitored. In most cases 
it is sufficient to test the ribozyme against its 
intended target in vitro and then transform a host cell 
with it for study of its in vivo effects. 

When it is desired to eliminate a host's RNA, such 
elimination should be as complete as possible. When it 
is desired to provide a new genetic sequence to a host 
cell, the trans-splicing reaction of the invention need 
not be complete. It is an advantage of the invention 
that, depending upon the biological activity of the 
peptide that is translated from such genetic sequence, 
the trans-splicing event may in fact be quite 
inefficient, as long as sufficient trans-splicing 
occurs to provide sufficient mRNA and thus encoded 
polypeptide to the host for the desired purpose. 

Transcription of the ribozyme of the invention in 
a host cell occurs after introduction of the ribozyme 
gene into the host cell. If the stable retention of 
the ribozyme by the host cell is not desired, such 
ribozyme may be chemically or enzymatically synthesized 
and provided to the host cell by mechanical methods, 
such as microinjection, liposome-mediated transfection, 
electroporation, or calcium phosphate precipitation. 
Alternatively, when stable retention of the gene 
encoding the ribozyme is desired, such retention may be 
achieved by stably inserting at least one DNA copy of 
the ribozyme into the host's chromosome, or by 
providing a DNA copy of the ribozyme on a plasmid that 
is stably retained by the host cell. 

Preferably the ribozyme of the invention is 
inserted into the host's chromosome as part of an 
expression cassette, such cassette providing 
transcriptional regulatory elements that will control 
the transcription of the ribozyme in the host cell. 
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Such elements may include, but not necessarily be 
limited to, a promoter element, an enhancer or UAS 
element, and a transcriptional terminator signal. 
Polyadenylation is not necessary as the ribozyme is not 
translated- However, such polyadenylation signals may 
be provided in connection with the sequence encoding 
the element to be trans-spliced. 

Expression of a ribozyme whose coding sequence has 
been stably inserted into a host's chromosome is 
controlled by the promoter sequence that is operably 
linked to the ribozyme coding sequences. The promoter 
that directs expression of the ribozyme may be any 
promoter functional in the host cell, prokaryotic 
promoters being desired in prokaryotic cells and 
eukaryotic promoters in eukaryotic cells. A promoter is 
composed of discrete modules that direct the 
transcriptional activation and/or repression of the 
promoter in the host cell. Such modules may be mixed 
and matched in the ribozyme* s promoter so as to provide 
for the proper expression of the ribozyme in the host. 
A eukaryotic promoter may be any promoter functional in 
eukaryotic cells, and especially may be any of an RNA 
polymerase I, II or III specificity. If it is desired 
to express the ribozyme in a wide variety of 
eukaryotic host cells, a promoter functional in most 
eukaryotic host cells should be selected, such as a 
rRNA or a tRNA promoter, or the promoter for a widely 
expressed mRNA such as the promoter for an actin gene, 
or a glycolytic gene. If it is desired to express the 
ribozyme only in a certain cell or tissue type, a cell- 
specific (or tissue-specific) promoter elements 
functional only in that cell or tissue type should be 
selected . 

The trans-splicing reaction is chemically the same 
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whether it is performed in vitro or in vivo. However, 
in vivo, since cofactors are usually already present in 
the host cell, the presence of the target and the 
ribozyme will suffice to result in trans-splicing. 

The trans-splicing ribozymes and methods of the 
invention are usful in producing a gene activity useful 
for the genetic modification, and/or cell death, of 
targeted cells. For example, the trans-splicing 
reaction of the invention is useful to introduce a 
protein with toxic properties into a desired cell. The 
susceptibility of cells will be determined by the 
choice of the target RNA and the regulatory controls 
that dictate expression of the ribozyme. For example, 
a ribozyme that transposes an RNA sequence encoding a 
toxic protein may be engineered so that expression of 
the ribozyme will depend upon the characteristics of an 
operably-linked promoter. In a highly preferred 
embodiment, diptheria toxin peptide A is encoded by 
that part of the ribozyme that is transposed into a 
desired target in the host. Conditional expression of 
the ribozyme and diphtheria toxin peptide A chain 
results in the death of the host cell. Other useful 
peptide toxins include ricin, exotonin A, and herpes 
thymidine kinase (Evans, G.A., Genes & Dev. 3:259-263 
(1989)). in addition, various lytic enzymes have the 
potential for disrupting cellular metabolism. For 
example, a fungal ribonuclease may be used to cause 
male sterility in plants (Mariani, C. et al.. Nature 
347-737-741 (1990)). Particular tissues might be 
destroyed due to limited expression of the target RNA. 
Further, if a viral RNA is used as target, new forms of 
virus resistance, or therapies may be engineered. 

A binary system for control of tissue-specific 
gene expression and/or for ectopic ablation may be 
designed using the ribozymes of the invention. For 
example, lines of Drosophila that express the yeast 
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transcription activator GAL4 in a tissue and spatial- 
specific pattern using P-element enhancer-trap vectors 
may be used. Any transcriptional activator may be used 
in place of GAL4 and the invention is not intended to 
be limited to GAL4 ; A gene encoding a fusion ribozyme 
that is capable of trans-splicing the DTA sequence may 
be placed under the control of the GAL4-UAS promoter 
and inserted into Drosophila in a genetically stable 
manner. Such ribozymes will not be expressed in 
Drosophila in the absence of GAL4. Accordingly, 
crossing Drosophila hosts genetically carrying this 
ribozyme construct with Drosophila hosts that express 
GAL4 in a tissue-specific manner result in progeny hat, 
when GAL4 expression is induced, exhibit a pattern of 
cell death similar to the pattern of GAL4 expression. 

In addition, by targetting the ribozyme to trans- 
splice with the GAL4 mRNA, the splicing activity of 
the ribozyme inactivates GAL4 expression and ribozyme • 
expression may be self-regulated. 

Pro-ribozyaes 

A trans-splicing ribozyme, as described above, 
consists of three fused sequence elements - a 5' "anti- 
sense" region which is complementary to the target RNA, 
the catalytic region which is based on a self-splicing 
Group I intron, and 3' "exon" sequences. The 5' region 
can base pair with the chosen target RNA, to bring it 
into proximity with the catalytic sequences of the 
Group I intron. The structure of the Group I intron 
provides a chemical environment suitable to catalyze 
the precise splicing of the target RNA with the 3' 
"exon" sequences. However, in the absence of the 
appropriate target RNA, the ribozyme sequences can 
still catalyze scission at the 3' "exon" junction 
(similar hydrolysis is seen for Group I self-splicing 
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intons (Zaug et al.. Science 231:470-475 (1986)), and 
may be able to catalyze illegitimate splicing events 
through transient base-pairing of the ribozyme with 
heterologous RNA sequences (which Bay include their 
own) . such side-reactions and illegitimate splicing 
events are unwanted, and may be deleterious. For 
example, if trans-spl icing is to be used for 
conditional delivery of a toxin in vivo, illegitimate 
trans-splicing might result in unexpected expression of 
the toxic activity. Spontaneous cleavage at the 3' 
••exon" junction would lower the efficiency of trans- 
splicing. 

To help avoid these problems, "pro-ribozyme" forms 
of the trans-splicing RNAs have been constructed 
wherein for example, helix P8 is disrupted. The pro- 
ribozymes are constructed to contain extra self- 
complementary sequences which cause the catalytic 
center of the ribozyme to be mis-folded. The pro- 
ribozymes are inactive in the absence of the intended 
target RNA; active forms are only formed after base- 
pairing of the ribozyme and target RNAs - with 
consequent displacement of the interfering secondary 
structure within the ribozyme. Pro-ribozymes are 
intended to be catalytically inert species in the 
absence of the target RNA, to eliminate unwanted self- 
cleavage, self-splicing and illegitimate trans-splicing 
reactions in vitro and in vivo (Figure 12). 

The pro-ribozymes described here are conformation- 
al^ disrupted and therefore inactive forms of the 
trans-splicing activities. Thus the pro-ritozymes 
possess little self-cleavage activity. They are only 
re-activated by specific interaction with the target 
RNA, and thus are substrate-activated ribozymes which 
are' less likely to catalyze trans -splicing to an 
unintended target RNA. Trans-splicing ribozymes are 
intended to be used for the delivery of new gene 
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activities in vivo, and any reduction in the extent of 
unwanted side reactions or illegitimate splicing is 
desirable, and may be necessary. 

While the disruption of helix P8 has been 
exemplified here for the trans-splicing pro-ribozyroes, 
other helices which are required for catalytic activity 
could also have been used. 

The same approach, of disrupting the conformation 
of a catalytically important structure in such a way 
that only base-pairing with the intended substrate RNA 
will allow the formation of an active ribozyme, could 
be applied to other ribozyme designs. For example, the 
loop sequence' of a "hammerhead" type endoribonuclease 
(Haseloff et al . , Nature 334:585-591 (1988)) could be 
extended and made complementary to one of the "anti- 
sense" arms of the ribozyme - similar to the above 
modification of helix P8. Endoribonuclease activity 
would only be exhibited after base-pairing with the 
chosen target RNA, displacement of the disrupting 
secondary structure, and reformation of the stem-loop 
structure required for catalysis. This would 
effectively increase the specificity of the ribozyme of 
its target. 

In addition, the activation of a pro-ribozyme need 
not rely on base-pairing with the substrate itself. 
Instead, a chosen third RNA or ssDNA or even protein 
might be required for activity. An additional base- 
pairing or RNA-protein interaction would be required 
for the formation of an active ribozyme complex. The 
availability of such additional components would 
determine ribozyme activity, and could be used to alter 
ribozyme selectivity. 

The ribozyme or pro-ribozyme of the invention may 
be introduced into any host cell, prokaryotic or 
eukaryotic and especially into a plant or mammalian 
host cell, and especially a human cell, either in 
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culture or in vivo, using techniques known in the art 
appropriate to such hosts. The ribozymes of the 
invention may also be engineered to destroy viruses. 
In one embodiment, the ribozyme or pro-ribozyme of the 
invention is provided in a genetically stable manner to 
a host cell prior to a viral attack. Infection by the 
appropriate virus, or expression of the latent virus in 
such host cell, (resulting in the appearance of the 
ribozyme «s or pro-ribozyme target RNA in the host 
cell), would stimulate the catalytic activity of the 
ribozyme and destruction of the viral RNA target and/or 
production of a- toxin via trans-splicing resulting in 
death of the virus infected cells. In another 
embodiment, the ribozyme or pro-ribozyme may be 
engineered and packaged into the virus itself. Such 
embodiments would be especially useful in the design of 
viruses for investigative purposes, wherein the 
ribozyme or pro-ribozyme may be designed to destroy the 
function of a specific viral RNA and thus allow the 
study of viral function in the absence of such RNA. 
Viruses carrying ribozymes may also be used as carriers 
to transfect host cells with a desired ribozyme or pro- 
ribozyme activity. 

Male or female sterility may be engineered in 
agronoroically important species using the ribozymes or 
pro-ribozymes of the invention. For example, male 
sterility in tobacco may be engineered by targetting 
TA29 or TA13 mRNA (tobacco anther-specific genes; 
Seurinck, J. et al., Nuc2 . Acids Res. 18:3403 (1990) 
with a ribozyme or pro-ribozyme of the invention that 
trans-splices the DTA 3« exon into those targets. 

The form of crop plants may be manipulated by 
selective destruction or modification of tissues using 
the ribozymes or pro-ribozymes of the invention. For 
example, seedless fruits may be made by targetting the 
seed storage protein mRNA with a ribozyme or pro- 
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ribozyme of the invention that trans-splices the DTA 3 • 
exon into the target. 

Transgenic plants may be protected against 
infection by expression of virus-specific ribozymes or 
pro-ribozyme to kill infected cells. This would be an 
artificial form the "hypersensitive response." For 
example, cucumber mosaic virus coat protein mRNA may be 
targeted with a ribozyme or pro-ribozyme of the 
invention that trans-splices the DTA 3' exon into the 
target. 

Populations of micro-organisms may be made 
resistant to specific pathogens by introduction of 
trans-splicing ribozymes or pro-ribozymes. For 
example, cheese-making bacteria may be made resistant 
to phage infection by targetting the phage RNA with a 
bacterial toxin gene or lytic enzyme encoded by the 3' 
exon provided by the ribozyme or pro-ribozyme of the 
invention, for example, which would interfere with 
phage replication by causing premature lysis after 
phage infection. 

Virus pathogens could be constructed to deliver 
toxic activities via trans-splicing. In this way, 
specific cell types could be targeted for ablation, 
such as for cancer or viral therapy. For example, HIV 
mRNA may be targeted by a ribozyme or pro-ribozyme of 
the invention that carries the DTA 3' exon, for either 
virus or liposome delivery. 

The examples below are for illustrative purposes 
only and are not deemed to limit the scope of the 
invention. 
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EXAMPLES 



Example 1 

Construction and C haracterization of a CAT-LacZ 
Trans-Splicing Ribozvme 



I. PCR Amplification and Cloni ng of the Ribozvme of 
10 the Invention 

Following the guidelines outlined above, a trans- 
splicing fusion ribozyme was designed that will splice 
a portion of the amino-terminal coding sequence of E. 
coli 0-galactosidase (LacZ) raRNA to a site in the 
chloramphenicol acetyl transferase (CAT) inRNA (Figure 
3). The sections of new sequence flanking the T. 
thermophila ribozyme core and the 3' exon were 
synthesized as oligonucleotides. The intact ribozyme 
sequence was then assembled by successive polymerase 
chain reactions, using the synthetic adaptor 
oligonucleotides as primers with ribozyme and 0- 
galactosidase DNA templates (while there are other 
methods available, this method is most convenient). 

For the construction of a ribozyme capable of 
splicing 0-galactosidase (LacZ) o-peptide coding 
sequence to a site in the 5' coding sequence of the 
chloramphenicol acetyl transferase (CAT) , three 
30 oligonucleotides were synthesized. 

5^-GGCCA 1 AGCT^ CTTTA CGATG CCATT GGGAT ATATC AACGG 
TGGTA TAAAC CCGTG GTTTT TAAAA GTTAT CAGGC ATGCA CC-3 ' 
35 [SEQ ID NO. 2] 

oliaonucleotide 2 

5 * -GATTA GTTTT GGAGT ACTCG TACGG ATTCA CGGCC GTCGT 



TTTAC AA-3' [SEQ ID NO. 3] 



40 



Oligonucleotide 3 

5'-GGCCG AATTC TTACA ATTTC CATTC AGGCT GCGCA ACTGT TGG- 
3' [SEQ ID NO. 4] 

Oligonucleotides 2 and 3 (200 pinoles each) were 
combined with 0.1 jig PvuII-cut pGEM4 DNA (which 
contained the LacZ or-peptide sequence) , and subjected 
to PCR amplification in a volume of 100 til containing: 

50 mM KC1, 

10 mM Tris-HCl pH 8.3, 

1.5 mM MgCl 2 , 

0.4 mM dNTPs, 

0.1% gelatin, and 

5 U TagI DNA polymerase, 

and incubated for 30 cycles, 1 min § 94 °C, 2 mins § 
50°C, 2 mins § 72°C. 

Plasmid pGEM4 is commercially available from 
Promega Corporation, Madison WI, USA. 

The amplified product of 210 base-pairs was 
purified using low-gelling temperature agarose 
electrophoresis, and was used as primer in a second 
round of PCR amplification. 

Following the second round of PCR amplification, 
2.0 fxg of 210 base-pair amplified product, 200 pmoles 
oligonucleotide 1 and 0.1 /xg 4 50 base-pair fragment 
containing the T. thermophila IVS were mixed and 
subjected to PCR amplification using the conditions 
shown above. The resulting 660 base-pair product was 
digested with the restriction endonucleases EcoRI and 
Hindlll, and cloned into the plasmid vector pGEM4 ♦ The 
complete sequence of the CAT-LacZ a-peptide ribozyme 
DNA sequence is presented as SEQ ID NO. 5 and Figure 
3B. 

The cloning vector containing the cloned sequences 
was transformed into, and propagated in, the bacterial 
host XLl/Blue (Strategene, La Jolla, California), using 
techniques known in the art (Maniatis, Molecular 
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Cloning, A Laboratory Guide, 2nd edition, 1989, Cold 
Spring Harbor Laboratory, Publishers). However, any 
bacterial host capable of stably maintaining the vector 
may be used, for example the JM109. 

The plasmid may be extracted from the host cell 
for further analysis using techniques commonly known in 
the art (Maniatis, Molecular Cloning, A Laboratory 
Guide, 2nd edition, 1989, Cold Spring Harbor 
Laboratory, Publishers) . 

jj. jry vitro Tenser lotio n of Cloned Fibpzyme and 
7»a y get RNAs 

Using standard procedures, cloned sequences were 
purified from the bacterial host and the plasmid 
linearized using a restriction endonuclease that does 
not cut the'ribozyme sequence, (for example, ScoRI), 
and transcribed using T7 RNA polymerase in a volume of 
100 ill, containing: 

5 fig linearized plasmid DNA, 
40 mM Tris-HC pH 7.5, 

6 mM MgCl 2 , 

2 mM spermidine, 
lOmM NaCl, 

iSTSSi (containing 20 >Ci {a-«P]OTP, if 

labelled RNA transcripts were desired) , 
100 U RNasin, and 
50 U T7 RNA polymerase, 

and the reaction was incubated at 37 »C for 2 hours. 



.RNA transcripts were purified by 5% polyacrylamide 
gel electrophoresis before use (TBE, 7M urea gel). 
RNAs containing active T.thermophila IVA sequences 
undergo some spontaneous scission at the 3' intron-exon 
junction during transcription. Fragments are removed 
by electrophoretic purification for clarity of analysis 
during subsequent trans-splicing assays. 
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III. In Vitro Trans-splicina Reactio n Conditions 

Target and/or trans-splicing ribozymes are 

incubated under the following conditions: 

O.i-0.5 RNA component (amount depends on type of 
experiment, usually ribozyme in 5-fold excess of 
target) , 

30 xnM Tris-HCl pH 7.5, 
100 mM NaCl, 
2mM GTP, 
5 mM MgCl 2 , 

in a volume of 5 nl at 42 °C, 60 mins. 

The reaction is diluted with 95 nl 0.1 mM Na 2 EDTA, 
200 mM NaCl, and ethanol pre.cipated. The RNAs are then 
analysed on 5% polyacrylamide gels containing TBE 
buffer, 7M urea and 25% formamide, and 
autoradiographed . 

IV. Assay of Endonucleolvtic Activity 

After base-pairing of the ribozyme and target, the 
first step in trans-splicing is the guanosine mediated 
cleavage of the target RNA at the intended 5' splice 
site. Annealing and trans-splicing may be performed in 
a buffer such as 30 mM Tris-HCl, pH 7.5, 100 mM NaCl, 
5 mM MgCl 2 , 2 mM GTP at 42°C. As the 3' splice site is 
dispensable for this reaction, truncated trans-splicing 
ribozymes should behave as highly-specific 
endoribonucleases. To test this activity, shortened in 
vitro transcripts of the CAT-LacZ a-peptide trans- 
splicing ribozyme described above (SEQ ID NO. 5 and 
Figure 3) were incubated with CAT mRNA sequences. The 
CAT-LacZ ribozyme cassette is on a HincJIII-EcoRI 
fragment. The Seal cleavage site marks a position 5 
bases upstream of the 3' splice site. The ribozyme 
specifically cleaved the target RNA at the expected 
single site to produce the expected size fragments. 
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V. Jhe rrans-spl ic ing Reaction 

To confirm the ability of the CAT-LacZ o-peptide 
ribozyme to catalyze the ligation of 3« exon sequences 
at the 5' splice site, various forms were incubated 
with radiolabeled CAT RNA. Ribozyme transcripts were 
synthesized from DNA templates which had been 3' 
truncated at one of several positions, ranging from the 
end of the ribozyme core through the exon sequence, 
incubation with labelled CAT led to the formation of 
the expected spliced products, which differed in length 
depending on the extent of 3' exon sequence. 

In addition, a certain proportion of the CAT-LacZ 
apeptide ribozyme molecules underwent spontaneous 
cleavage at the 3' splice site during in vitro 
transcription, similar to the intact T. thermophila 
intron. These cleaved forms, terminated at the 
guanosine residue adjacent the 3' splice site, were 
also incubated with CAT RNA. In this case, the ribozyme 
itself is ligated to a 3' portion of the CAT RNA, to 
produce a product of about 550 nucleotides in size. 
This reaction is similar to the self-circularization of 
the intact intron, and the same ligation product is 
found in the other trans-splicing reactions. 

VI. accuracy " f the Tr ans-splicing 

The products from a CAT-LacZ a-peptide trans- 
splicing reaction were reverse-transcribed, and 
amplified by polymerase chain reaction using two 
oligonucleotides complementary to sequences on either 
side of the predicted splice sites. Amplified 
sequences were cloned and sequenced. Individual 
recombinants showed no variation from the expected 
sequence of the spliced products. As found in studies 
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with the intact intron, splicing appears to be highly 
accurate. 

Accordingly, the studies above show that a trans- 
splicing ribozyme designed according to the guidelines 
of the invention is capable of accurate, effective 
trans-splicing in vitro. 

Example 2 

Design of a Trans-Sol icina Ribozym e that Provides 
Plant Vir us Resistance 

Cucumber mosaic virus (CMV) is a pandemic virus 
with a large number of known strains. Nine sequence 
strains are shown in the region of the start of their 
coat protein cistron encoded in RNA 3 and the 
subgenomic mRNA 4 (SEQ ID NOS. 7-25; Figures 4(A) and 
5) . Two sites have been chosen which are conserved in 
sequence and downstream from the AUG start codon of the 
coat protein. Oligonucleotides for the construction of 
ribozymes capable of trans-splicing the ile-mutant form 
of DTA into the CMV coat protein mRNA are shown in 
Figure 4B and is discussed below. 

The trans-splicing ribozymes shown in Figure 4C 
and D are targetted to the CMV virus sequences shown in 
Figure 4B and will result not only in the cleavage of 
the CMV RNA molecules but in the expression of 
diphtheria toxin A-chain in the infected cell. The 
trans-splicing cassettes shown in Figure 4 may be 
transformed into any CMV-susceptible plant species 
using techniques known in the art, and transgenic 
progeny challenged by CMV infection. The design of the 
ribozyme is such that virus infection is necessary to 
initiate toxin production via RNA trans-splicing 
because the ribozyme itself is not translated. The 
localized death of the infected cells that results from 
expression of the toxin could limit replication and 
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spread of the virus within the plant giving 
artificial hypersensitive response. 



pyample 3 

rnnctr»rti 0 n and rharacteri *ati on of a 
pT ^i^-Ha Toxin A fhain Tran^-SpH n ng pibozyme 

According to the invention and the methods 
described in Example 1, a fusion ribozyme has been 
designed that is a Gal4-Diphtheria toxin A chain trans- 
splicing ribozyme (Figure 6). The sequence of this 
ribozyme is shown as SEQ ID NO. 6. The GAL4-DTA 
ribozyme cassette is a Sal I-Xhol fragment. The Seal 
site marks a position 5 bases upstream of the 3 « splice 
site. This ribozyme is capable of splicing the coding 
sequence for the A chain of the diphtheria toxin to a 
site in the 5' region of the GAL4 mRNA. This trans- 
splicing activity is active both in vitro (as above) 
and in vivo (below) . The major criteria for successful 
design of the GAL4-DTA ribozyme, and any trans-splicing 
ribozyme that trans-splices a sequence encoding a toxic 
product, are not only the efficient and precise 
catalysis of trans-splicing, but also that expression 
of the toxic product, for example, DTA does not occur 
in the absence of trans-splicing. 

The catalytic portion of the ribozyme is 
constructed according to the design outlined above, and 
5' and 3' splice sites chosen within the 5« coding 
regions of GAL4 and DTA, respectively. The 3* exon 
sequence corresponds to that of a DTA gene already used 
for expression in eukaryotes, except for the removal of 
the first AUG codon and several proximal amino acids. 
The original C. diphtheriae form of DTA also differs in 
this 5' region, utilizing a CUG codon for translation 
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initiation. The original DTA sequence also contains a 
signal peptide leader sequence which is absent. 

These ribozyme molecules can undergo spontaneous 
scission at the 3' splice site. Given the extreme 
toxicity of DTA, it is important that any liberated 3' 
exon sequences not give rise to toxic translation 
products. The 3 1 exon contained an in-frame methionine 
at position 13, which could conceivably give rise to a 
truncated but toxic polypeptide. To eliminate this 
possibility, the wild-type sequence (Rz-DTA^t) was 
altered from methionine at this position to isoleucine 
(Rz-DTA ile ) or leucine (Rz-DTA leu ) in two separate 
ribozyme constructions (Figure 6) . 

Example 4 

in Vivo Activity of the R ibozvmes of the Invention 

I. introduction 

The in vivo activity of a ribozyme designed 
according to the guidelines provided herein, and the 
ability of such a ribozyme to deliver new gene 
activities to host cells, was demonstrated using the 
Gal4-Diphtheria toxin A chain trans-splicing ribozyme 
described (Example 3 and in Figure 6) to deliver the 
highly toxic diphtheria toxin A product to a host cell. 
In this system, Drosophila was the chosen host and it 
was desired to control expression of the ribozyme of 
the invention in a tissue-specific manner within the 

Drosophila host. 

Diphtheria toxin is secreted by Corynebacterium 
diphtheriae lysogenic for B phage. The toxin is 
produced as a single polypeptide which undergoes 
proteolysis to produce A and B chains. The A chain 
(DTA) contains a potent ADP ribosylase activity which 
is specific for the eukaryote translation elongation 
factor EF-2. The presence of even a few molecules of 
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this enzyme is enough to cause cessation of translation 
and eventual death in a variety of eukaryote cells. 
The B chain allows intracellular delivery by attachment 
of the toxin to cell surface receptors by binding 
mannose residues, is endocytosed and enters the 
cytoplasm by vesicular fusion. 

In the absence of the B-chain, the A-chain is much 
less toxic when present extracellular ly. This 
property, and its extreme toxicity, have suggested its 
use for ectopic ablation experiments. For example, 
sequences encoding DTA have been expressed in 
transgenic mice, using an opsin promoter to drive 
expression in developing eyes. The resulting mice are 
blind, with deformed eyes (Breitman, M.L., Science 
238:1563-1565 (1987)). In other studies, ablation of 
the mouse pancreas was performed (Palmiter, R.D. et 
al., Cell 50:435-443 (1987)) and Wert, S.E. et al., Am. 
Rev. Respir. Dis. 141 fno. 4, part 2) :A695 (1990) 
described ablation of alveolar cells by use of a 
chimeric gene consisting of the promoter and 5- 
flanking sequence of the human surfactant protein C 
gene (expressed in type II alveolar cells) and the DTA 
gene. 

However, using this type of approach, it is not 
possible to maintain or propagate transformed organisms 
which might have more severe, or lethal phenotypes. In 
addition, transformation of certain species, such as 
Drosophila, with intact DTA sequences has not been 
reported to date. Leaky expression of the DTA gene 
during such transformations leads to immediate death. 

II . The Drosphila System 

A general method for targeting gene expression in 
Drosophila has been developed. First, the system 
allows the rapid generation of individual strains in 
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which ectopic gene expression can be directed to 
different tissues or cell types: the enhancer detector 
technique is utilized (O'Kane, C.J. and Gehring, W.J., 
Proc. Natl. Acad. Sci. USA:9123-9127 (1987); Bellen et 
al., Genes and Development 3:1288-1300 (1989); Bier et 
al., Genes and Development 3:1273-1287 (1989)) to 
express a transcriptional activator protein in a wide 
variety of patterns in embryos, in larvae and in 
adults. Second, the method separates the activator 
from its target gene in distinct lines, to ensure that 
the individual parent lines are viable: in one line 
the activator protein is present but has no target gene 
to activate, in the second line the target gene is 
silent. When the two lines are crossed, the target 
gene is turned on only in the progeny of the cross, 
allowing dominant phenotypes (including lethality) to 
be conveniently studied. 

To ectopically express only the gene of interest, 
a transcriptional activator that has no endogenous 
targets in flies is required. An activator from yeast, 
Gal4, can activate transcription in flies but only from 
promoters that bear Gal4 binding sites (Fischer et al . , 
Nature 332:853-865 (1988)). To target gene expression, 
Gal4 is restricted to particular cells in two ways: 
either Gal4 transcription is driven by characterized 
fly promoters, or an enhancer less Gal4 gene is randomly 
integrated in the Drosophila genome, bringing it under 
the control of a diverse array of genomic enhancers. 
To assay transactivation by Gal4, flies that express 
Gal4 are crossed to those bearing a lacZ gene whose 
transcription is driven by Gal4 binding sites (Fischer 
et al . , Nature 332:853-865 (1988)). £-galactosidase is 
expressed only in those cells in which Gal4 is first 
expressed. Tissue- and cell-specific transactivation 
of lacZ has been demonstrated in strains in which Gal4 
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is expressed and in which a variety of patterns are 
established. 

With this system, it is now possible: 1) to place 
Gal4 binding sites upstream of any coding sequence; 2) 
to activate that gene only within cells where Gal4 is 
expressed and 3) to observe the effect of this aberrant 
expression on development. In cases where ectopic 
expression is lethal, this method allows the two parent 
lines (one expressing Gal4, the other carrying a silent 
gene bearing Gal4 binding sites in its promoter) to be 
stably propagated. Phenotypes can then be studied in 
the progeny of a cross. 



III. vectors 

The vectors utilized as starting materials in 

these studies include: 

D r r.aTR and dGATN (figure 9): These vectors 
are used for cloning promoters and enhancers upstream 
of a promoterless Gal4 gene. 

Vectors were constructed in which either a unique 
jffotl or BamHI site is inserted upstream of the Gal4 
coding region. Once a promoter has been linked to the 
Gal4 coding sequence, the gene can be excised from the 
pHSREM vector backbone (Knipple and Marsella-Herrick, 
Nucl. Acids Res. 16:7748 (1988)) and moved into a P- 
element vector. The Rh2 promoter has been cloned 
(Mismer et al . , Genetics 120:173-180 (1988)) into this 
vector and flies have been generated in which Gal4 is 
expressed only in the ocelli. 

2) pGawB : This is a Gal4 vector for use in 

enhancer detection. 

An enhancerless Gal4 gene was subcloned into the 
vector plwB (Wilson et al . , Genes and Development 
3:1301-1313 (19B9)) to create pGawB . plwB was first 
digested with to remove the lacZ gene and the 

N-terminus of the P-transposase gene. These were 
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replaced with the entire Gal4 coding region behind the 
TATA box of the P-transposase gene. 

3) pUAST (Figure 10) : This plasroid was used for 
cloning coding sequences downstream of the Gal UAS. 

A vector into which genes can be subcloned behind 
the Gal4 UAS (Upstream Activation Sequence) was 
constructed in the P-element vector, pCaSpeR3 (C. 
Thummel, Univ. of Utah Medical Center, Salt Lake City, 
Utah, personal communication) • Five Gal4 binding sites 
were inserted, followed by the hsp70 TATA box and 
transcriptional start, a polylinker, and the SV40 
intron and polyadenylation site. Unique sites into 
which genes, or cDNAs, can be inserted include: EcoRI, 
Bglll, Wot I, Xhol, Kpnl and XJbal. 

IV. Drosophila Strains 

The genetic techniques described herein used to 
characterize the strains of Drosophila utilized in 
these studies are well known in the art ("Genetic 
Variations of Drosophila melanogaster D. Lindsley and 
E.H. Grell, eds) . 

The P-element transposons are mobilized using the 
" jumpstarter" strain that carries A2-3, a defective 
P-element on the third chromosome that expresses high 
levels of a constitutively active transposase 
(Robertson et al . , Genetics 218:451-470 (1988)). The 
three stocks currently used to generate and map the 
insertion lines were deposited in the Drosophila Stock 
Center, Indiana University Department of Biology, 
Jordan Hall A 503, Bloomington, Indiana 47405: 
1: y w; +/+; Sb P[ry\ A2-3J/TM6, Ubx 
2: w; + / + ; TM3, Sb/CxD (deposit no. 3665) 
3: w; CyO/Sco; +/+ (deposit no. 3666) 
where the genetic characteristics of the three 
chromosomes are separated by semicolons. Thus, for 
example, in strain 1, the first chromosome (the X 
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chromosome) is homozygous for yellow and white («y w») , 
the second chromosome is wild-type («+/+"), and the 
third chromosome carries the stubble gene («Sb«) , and 
the P element transposon rosy gene («ry + «) and A2-3, 
while the second third chromosome carries balancer 
inversions ( M /™6, Ubx"). 

V. fi-rateav fo^ ftgneratin " r, a T4 Exprpsgion patterns 

A . tteheme uppH to is olate trfnsf onnants 
Constructs are injected into embryos derived from the stock; 
99 y w/y w ; A2-3 ,Sb/TM6,Ubx X 66 y w/Y ; A2-3 , Sb/TM6,Ubx 
fO; Establish si ngle lines 

9 y w/y, w ; A2-3,Sb/TM6,Ubx X 6 y w/Y; +/+ 



or 



6 y w/Y ; A2-3,Sb/TM6 f Ubx X 9 y w/y w; +/+ 

fy select and r p VyM orooenv and establish stocks , 

9 y w/y w ; +/TM6,Ubx X 6 y w/Y; +/+ 

or 

5 y w/Y ; + /TM6,Ubx X 9 y w/y w; +/+. 



B 



grh»«».s useH ir> -jump tb » enhancer] ess Gal 4 insert 



3 T» mpc fr om the Y-chromosome 

99 FM3/FM7,w; +/ + ^ y w/Y; A2-3,Sb/TM6,UbX 

99 FM7 ,w/ P[Gal4,w*l * 66 FM7/Y ; A2-3,Sb/+ 

99 FM7,w/ P[Gal4,w*]; A2-3,Sb/ + ^X 66 y w/Y; +/+ 

9 FM7,w/ y w ; A2-3,Sb/ + X 6 y w/Y; +/+ 

Select [w + ] and [B] progeny and establish stocks 

?. Jumps the A ?-^-chromosome 

99 y w/y w X d<J y w/Y ; P[Gal4,w + ], A2-3,Sb/+ 
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Select [w 4 ] and [Sb + ] progeny and establish stocks. 

C. Chromosomal segregation 

To analyze the segregation of the insertions two stocks are used 
w;+/+; TM3, Sb/CxD and w; CyO/Sco;+/+. 

Method 

To create a large number of strains that express 
Gal4 in a cell- or tissue-specific manner enhancer 
detection vectors have been built that carry different 
versions of the Gal4 gene. Two genes, encoding either 
the full-length protein or a truncated protein, have 
been cloned into rosy (ry + ) and white (w + ) P-element 
vectors (modified versions of plArB and plwB; Wilson et 
al., Genes and Development 3:1301-1313 (1989)). Using 
ry* or w* as a screen, these vectors have been 
mobilized by introduction of the A2-3 gene (Robertson 
etaJ., Genetics 218:461-470 (1988)). To visualize the 
expression pattern of Gal4, the Gal4 insertion lines 
are crossed to a strain that carries the lacZ gene 
under the control of the Gal4 UAS (Fischer et al . 
Nature 332:853-865 1988). Embryos, larvae and adults 
derived from these crosses are screened for j8- 
galactosidase expression either by an enzyme assay, 
with X-gal as a substrate, or by staining with 
monoclonal antibodies against 0-galactosidase. jS- 
galactosidase encoded by the UAS-iacZ construct is 
localized in the cytoplasm. 

Approximately 500 Gal4-insertion strains have been 
screened and many that can be used to activate genes in 
specific tissues have, been identified such as, for 
example, epidermal stripes, mesoderm, the central 
nervous system and the peripheral nervous system. Many 
of the lines express jS-galactosidase in the salivary 
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glands as well as in other tissues. It is possible 
that in constructing the enhancerless-Gal4 transposon 
a position-dependent salivary gland enhancer was 
fortuitously generated. 



VI. Sample Screen 





No Staining 


Salivarv Gland 


Other Tissues 


% 


f of Strains 
9 


+ 






5.8 


45 




«+ 




28.8 


81 




■f 


+ 


51.9 


21 








13^5 


156 








LOO-0 



a Gal4 insertion line is selected and crossed to a 
strain that carries Gene X cloned behind the GAL UAS. 

Vil. c.^r-v of ^ r,AT.*/UftS System without the 
Ribozvme 

The Gal4/UAS system is a two-part system for 
controlling gene activatiion. The method is versatile, 
can be tissue-specific and does not appear to exhibit 
a basal level of expression except perhaps, as 
described herein, for a UAS-DTA construct. It can be 
used to ectopically express characterized genes, to 
express modified genes that would otherwise be lethal 
to the organism and to express genes from other species 
to study their effect on Drosophila development. Since 
the method makes it possible to produce dominant, gain- 
of-f unction mutations, epistasis tests and screens for 
enhancers or suppressors of visible or lethal 
phenotypes can be carried out. The Gal4 system also 
allows the expression of toxic products to study the 
consequences of cell- and tissue-specific ablation. 
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VIII. Use of Gal4-Expressina Drosophila with the 

pTA Ribozyme of the Invention 

Expression of the fusion ribozyme carrying the 
sequences encoding the DTA protein was placed under the 
control of a the GAL4 UAS (upstream activator sequence) 
in pUAST (Figure 10). As stated supra, using modified 
p-eleroent enhancer-trap vectors described above, a 
large number of stable lines of Drosophila were 
constructed which each express the yeast 
transcriptional activator GAL4 in specific spatial and 
temporal patterns in the developing flies. Any gene 
under the control of the GAL4 upstream activator 
sequence (UAS) can be transformed and maintained 
singly, then induced in particular Drosophila tissues 
by genetic crossing to lines which express GAL4 (Figure 
7). However, it was not possible to take advantage of 
the Gal4 system for expression of DTA per se without 
further modification, due to the difficulty in 
producing UAS -DTA transf ormants through leaky 
expression of the DTA. 

It was found that use of this two-element system 
as a means of conditionally expressing DTA via a trans- 
splicing ribozyme (Figure 6) overcame these problems. 
In those cells expressing GAL4 , the GAL4 protein 
provides the activity necessary for ribozyme 
transcription, and the GAL4 mRNA provides the target 
for trans-splicing necessary for DTA production. 

Drosophila embryos may be injected with ribozyme 
sequences placed under the control of a UAS promoter 
as described above, using techniques known in the art. 
Embryos injected with the Rz-DTA met construction will 
not survive, whereas normal transformed flies were 
obtained from embryos injected with both Rz-DTA ile and 
Rz-DTA leu . This result suggested that the internal AUG 
codon was indeed acting as an initiation codon for the 



WO 92/13089 



PCT/US92/00277 



- 52 - 



translation of a toxic product after injection. The 
codon is. adjacent to proposed NAD+ binding site in the 
DTA sequence, and to sequences conserved in the 
distantly related exotoxin A, another EF-2 specific 
APP-ribosylase from Pseudomonas aeruginosa. 

Transgenic flies containing the Rz-DTA ile and Rz- 
DTA leu sequences under control of the GAL4 UAS were 
crossed to flies producing GAL4 in particular patterns 
of expression. For example, in one characterized line, 
line 1J3, the GAL4 gene was been inserted near the 
hairy gene, and mirrored its pattern of expression. 
The hairy gene product is produced in epidermal stripes 
in the even-numbered abdominal segments during 
embryogenesis. When a UAS-driven LacZ gene was 
introduced into 1J3 in which GAL4 is expressed in the 
same pattern as the hairy gene product, 0-galactosidase 
was found localized within the even-numbered stripes. 
When flies containing the Rz-DTA leu gene were crossed 
to this GAL4 -expressing line, normal progeny resulted. 
However, when flies containing Rz-DTA ile were crossed 
to the GAL4-expressing line, development of the progeny 
was arrested in embryogenesis. Darker colored bands 
were evident on the cuticles of the embryos, consistent 
with the death of underlying cells. When cuticle 
preparations were examined, the even-numbered denticle 
bands were disrupted or missing, particularly those of 
the 4th, 6th and 8th stripes (Figure 11). Other 
specific patterns of cell death were observed when the 
containing Rz-DTA ile flies are crossed to different 
GAL4 expressing genes. 
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Example 5 
Design of Pro-ribozymes 

5 As a test for the design of pro-ribozymes, the 

CAT-LacZ trans-spl icing ribozyme which described 
earlier was modified (Figure 2). Phylogenetic 
comparisons and mutational analysis (for review, see 
Cech, Ann Rev. Biochem. 59:543-568 (1990)) have 

10 indicated that a core region of the group I self- 

splicing introns is highly conserved and important for 
activity (Figure 8). For the construction of trans- 
splicing pro-ribozymes a helix immediately adjacent to 
this region, P8, was disrupted. In the first 

15 experiments, 13 or 18 nucleotides of new sequence were 

introduced into the 5' strand and loop of helix P8, to 
produce pro-ribozyme 1 and 2, respectively. The extra 
nucleotides were complementary to the 5' "anti-sense" 
portion of the ribozyme, while the flanking sequences 

20 were adjusted to conserve (1) the actual sequences at 

the base of P8, and (2) the extent of base-pairing 
possible within P8 (Figure 13). The extent of self- 
complementarity between the sequences inserted into 
helix P8 and the 5' "anti-sense" region of the pro- 

25 ribozyme is such that this new helix would be expected 

to form in nascent transcripts, in preference to helix 
P8. The formation of this alternative helix would also 
be expected to disrupt flanking secondary and perhaps 
tertiary interactions within the catalytic core of the 

30 ribozyme. Thus, mis-folding of the pro-ribozyme would 

render it catalytically inactive (Figure 14) . However, 
base-pairing of the pro-ribozyme with the intended 
target RNA would displace the P8-"anti-sense" base- 
pairing, sequester the "anti-sense" sequences and allow 

35 re-formation of the P8 helix and an active catalytic 



% 
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domain. Displacement of the P8-«anti-sense« helix 
results in a greater sum of base-pairs and allows 
proper folding of the catalytic domain, so should be 
energetically favored. 

CAT-LacZ pro-ribozymes 

Cloned sequences corresponding to the two CAT-LacZ 
pro-ribozymes were constructed using PCR-mutagenesis 
as discussed above, and RNAs were produced by in vitro 
transcription. The CAT-LacZ trans-splicing ribozyme 
was observed to undergo scission during transcription 
at the 3' splice junction, as a result of hydrolysis 
catalyzed by the intron sequences. Similar hydrolysis 
is seen in in vitro transcripts of the unmodified 
Tetrahymena thermophila intron. In contrast, 

transcripts of the different CAT-LacZ pro-ribozymes are 
more stable, with little cleavage evident under the 
same conditions (Figure 15) . This indicates that the 
pro-ribozymes are inactive, which would be expected if 
the catalytic sequences were mis-folded. Truncated 
forms of the pro-ribozymes were tested for specific 
endoribonuclease activity directed against the CAT RNA. 
CAT-LacZ pro-ribozyme RNAs were transcribed from 
templates truncated at the Seal site, to remove the 3' 
splice junction and LacZ sequences. Both ribozyme and 
pro-ribozyme RNAs are stable after removal of the 3' 
splice site. Incubation of the truncated pro-ribozymes 
with CAT RNA led to specific cleavage of the target RNA 
to give fragments of the expected sizes (Figure 16). 
Specific cleavage activity was seen at 37, 45 and 50 
degrees. 

Pro-ribozyme forms of the GAL4-DTA trans-splicing 
ribozyme were also constructed (Figure 17). Regions 
of 20 nucleotides (complementary to the "anti-sense" 
region) were inserted into the 5' strand and loop of 
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helix P8. The two pro-ribozymes differed in the extent 
of base-pairing possible in the modified helices P8, 
and GAL4-DTA pro-ribozyme 1 possessing both a longer 
stem and fewer (3) accessible bases in the loop. The 
helix P8 of GAL4-DTA pro-ribozyme 2 more closely 
resembles that of the CAT-LacZ pro-ribozyme 2, with a 
larger loop (14 bases) containing sequences 
complementary to the "anti-sense" region. Transcripts 
of the GAL4-DTA pro-ribozymes are more stable than 
those of the unmodified ribozyme. In particular, pro- 
ribozyme 2 is mainly intact after incubation in 
conditions that result in essentially complete self- 
cleavage of the ribozyme form (30' § 50°C, 10 mM MgCl 2/ 
2 mM GTP, see Figure 18) . 

Having now fully described the invention, it will 
be understood by those with skill in the art that the 
scope may be performed within a wide and equivalent 
range of conditions, parameters and the like, without 
affecting the spirit or scope of the invention or any 
embodiment thereof. 



WO 92/13089 



- 56 - 



PCT/US92/00277 



SEQUENCE LISTING 
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Goodman, Howard M. 
Brand, Andrea 
Perrimon, Norbert 
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(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Goldstein, Jorge A 
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(2) INFORMATION FOR SEQ ID NO :1: 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 517 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: linear 
(ii) MOLECULAR TYPE: DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 

TGACGCAATT CAACCAAGCG CGGGTAAACG GCGGGAGTAA CTATGACTCT 

CTAAATAGCA ATATTTACCT TTGGAGGGAA AAGTTATCAG GCATGCACCT 

CCTAGCTAGT CTTTAAACCA ATAGATTGCA TCGGTTTAAA AGGCAAGACC 

GTCAAATTGC GGGAAAGGGG TCAACAGCCG TTCAGTACCA AGTCTCAGGG 

GAAACTTTGA CATGGCCTTG CAAAGGGTAT GGTAATAAGC TGACGGACAT 

GGTCCTAACC ACGCAGCCAA GTCCTAAGTC AACAGATCTT CTGTTGATAT 

GGATGCAGTT CACAGACTAA ATGTCGGTCG GGGAAGATGT ATTCTTCTCA 

TAAGATATAG TCGGACCTCT CCTTAATGGG AGGTAGCGGA TGAATGGATG 

CAACACTGGA GCC6CTGGGA ACTAATTTGT ATGCGAAAGT ATATTGATTA 

GTTTTGGAGT ACTCGTAAGG TAGCCAAATG CCTCGTCATC TAATTAGTGA 

CGCGCATGAA TGGATTA 

(2) INFORMATION FOR SEQ ID N0:2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 82 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULAR TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:2: 

GGCCAAGCTT CTTTACGATG CCATTGGGAT ATATCAACCG TGGTATAAAC 
CCGTGGTTTT TAAAAGTTAT CAGGCATGCA CC 

(2) INFORMATION FOR SEQ IQ N0:3: 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 47 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULAR TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

GATTAGTTTT GGAGTACTCG TACGGATTCA CGGCCGTCGT TTTACAA 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: linear 

(ii) MOLECULAR TYPE: DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID N0:4: 
GGCCGAATTC TTACAATTTC CATTCAGGCT GCGCAACTGT TGG 



(2) INFORMATION FOR SEQ ID N0:5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 623 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULAR TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
GGGAGACCGG AAGCTTCTTT ACGATGCCAT TGGGATATAT CAACGGTGGT 
ATAAAGCCGT GGTTTTTAAA AGTTATCAGG CATGCACCTG GTAGCTAGTC 
TTTAAACCAA TAGATTGCAT CGGTTTAAAA GGCAA6ACCG TCAAATTGCG 
66AAAGGGGT CAACAGCCGT TCAGTACCAA GTCTCAGGGG AAACTTTGAG 
ATGGCCTTGC AAAGGGTATG GTAATAAGCT GACGGACATG GTCCTAACCA 
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CGCAGCCAAG TCCTAAGTCA ACAGATCTTC TGTTGATATG GATGCAGTTC 300 

ACAGACTAAA TGTCGGTCGG GGAAGATGTA TTCTTCTCAT AAGATATAGT 350 

CGGACCTCTC CTTAATGG6A GCTAGCGGAT GAAGTGATGC AACACTGGAG 400 

CCGCTGGGAA CTAATTTGTA TGCGAAAGTA TATTGATTAG TTTTGGAGTA 450 

CTCGTACGGA TTCACTGGCC GTCGTTTTAC AACGTCGTGA CTGGGAAAAC 500 

OCTGGCGTTA CCCAACTTAA TCGCCTTGCA GCACATCCCC CTTTCGCCAG 550 

CTGGCGTAAT AGCGAAGAGG CCCGCACCGA TCGCCCTTCC CAACAGTTGC 600 

GCAGCCTGAA TGGAAATTGT AAG 623 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1038 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:6: 

GTCGACCTTT TTAAGTCGGC AAATATCGCA TGTTTGTTCG ATAGACATCG AGTGGCTTCA 60 

AAAGTTATCA GGCATGCACC TGGTAGCTAG TCTTTAAACC AATAGATTGC ATCGGTTTAA 120 

AAGGCAAGAC CGTCAAATTG CGGGAAAGGG GTCAACAGCC GTTCAGTACC AAGTCTCAGG 180 

GGAAACTTTG AGATGGCCTT GCAAAGGGTA TGGTAATAAG CTGACGGACA TGGTCCTAAC 240 

CACGCAGCCA AGTCCTAAGT CAACAGATCT TCTGTTGATA TGGATGCAGT TCACAGACTA 300 

AATGTCGGTC GGGGAAGATG TATTCTTCTC ATAAGATATA GTCGGACCTC TCCTTAATGG 360 

GAGCTAGCGG ATGAAGTGAT GCAACACTGG AGCCGCTGGG AACTAATTTG TATGCGAAAG 420 

TATATTGATT AGTTTTGGAG TACTCGTCTC GATGATGTTG TTGATTCTTC TAAATCTTTT 480 

GTGATTGAAA ACTTTTCTTC GTACCACGGG ACTAAACCTG GTTATGTAGA TTCCATTCAA 540 
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AAAGGTATAC 


AAAAGCCAAA 


ATCTGGTACA 


CAAGGAAATT ATGACGATGA TTGGAAAGGG 


600 


TTTTATAGTA 


CCGACAATAA 


ATACGACGCT 


GCGGGATACT CTGTAGATAA TGAAAACCCG 


660 


CTCTCTGGAA 


AAGCTGGAGG 


CGTGGTCAAA 


GTGACGTATC CAGGACTGAC GAAGGTTCTC 


720 


GCACTAAAAG 


TGGATAATGC 


CGAAACTATT 


AAGAAAGAGT TAGGTTTAAG TCTCACTGAA 


760 


CCGTTGATGG 


AGCAAGTCGG 


AACGGAAGAG 


TTTATCAAAA 6GTTCGGTGA TGGTGCTTCG 


640 


CGTGTAGTGC 


TCAGCCTTCC 


CTTCGCT6AG 


GGGAGTTCTA GCGTTGAATA TATTAATAAC 


900 


TGGGAACAGG 


CGAAAGCGTT 


AAGCGTAGAA 


CTTGAGATTA ATTTTGAAAC CCGTGGAAAA 


960 


C6TGGCCAAG 


ATGCGATGTA 


TGAGTATATG 


GCTCAAGCCT GTGCAGGAAA TCGTGTCAG6 


1020 


C6ATCTTTGT 


GACTCGAG 






1038 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 134 bases 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULAR TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:7: 
GTTTAGTTGT TCACCTGAGT CGTGTGTTTT GTATTTTGCG TCTTAGTGTG 
CCTATGGACA AATCTGGATC TCCCAATGCT AGTAGAACCT CCCGGCGTCG 
TCGCCCGCGT AGAGGTTCTC GGTCCGCTTC TGGT 

(2) INFORMATION FOR SEQ ID N0:8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 134 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULAR TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:8: 
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GTTTAGTTGT TCACCTGAGT CGTGTTTTCT TTGTTTTGCG TCTCAGTGTG 
CCTATGGACA AATCTGGATC TCCCAAT6CT AGTAGAACCT CCCGGCGTCG 
TCGCCCGCGT AGAGGTTCTC GGTCCGCTTC TGGT 

(2) INFORMATION FOR SEQ ID N0:9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 152 bases 

(B) TYPE: nucleic acid 

(C) STRANDEONESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULAR TYPE: DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID N0:9: 
GTTATT6TCT ACTGACTATA TAGA6AGTGT TTGTGCTGTG TTTTCTCTTT 
TGTGTCGTAG AATTGAGTCG AGTCATGGAC AAATCTGAAT CAACCAGTGC 
TGGTCGTAAC CGTCGACGTC GTCCGCGTCG TG6TTCCCGC TCCGCCCCCT 
CC 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 152 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULAR TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
GTTATTGTCT ACTGACTATA TAGAGAGTGT GTGTGCTGTG TTTTCTCTTT 
TGTGTCGTAG AATTGAGTCG AGTCATGGAT AAATCTGAAT CAACCAGTGC 
TGGTCGTAAC CGTCGACGTC GTCCGCGTCG TG6TTCCCGC TCCGCCTCCT : 
CC 
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(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 131 bases 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULAR TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
AGAGAGTGTG TGTGCTGTGT TTTCTCTTTT GTGTCGTAGA ATTGAGTCGA 
GTCATGGACA AATCT6AATC AACCA6TGCT G6TCGTAACC GTCGACGTC6 
TCC6CGTCGT GGTTCCCGCT CCGCCCCCTC C 



(2) INFORMATION FOR SEQ ID N0:12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 154 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULAR TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
GTTATTGTCT ACTGATTGTA TAAAGAGTGT GTGTGTGCTG TGTTTTCTCT 
TTTACGTCGT AGAATTGAGT CGAGTCATGG ACAAATCTGA ATCAACCAGT 
6CTGGTCGCA ACCGTCGACG TCGTCCGCGT CGTGGTTCCC GCTCCGCCCC 
CTCC 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 154 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 



WO 92/13089 



- 63 - 



(ii) MOLECULAR TYPE: DNA 

{xi) SEQUENCE DESCRIPTION: SEQ ID N0:13: 
GTTATTGTCT ACTGACTATA TAGAGAGTGT GTGTGTGCTG TGTTTTCTCT 
TTTGTGTCGT AGAATTGAGT CGAGTCATGG ACAAATCTGA ATCAACCAGT 
GCTGGTCGTA ACCGTCGACG TCGTTTGCGT CGTGGTTCCC GCTCCGCCTC 
CTCC 

(2) INFORMATION FOR SEQ ID N0:14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 130 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULAR TYPE: DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
GAGTGTGTAT GTGCTGTGTT TTCTCTTTTG TGTCGTAGAA TTGAGTCGAG 
TCAT6GACAA ATCTGAATCA ACCAGTGCTG GTCGTAACCG TCGACGTCGT 
CCGCGTCGTG GTTCCCGCTC CGCCCCCTCC 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 152 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 
(ii) MOLECULAR TYPE: DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID N0:15: 

GTTATTGTCT ACTGACTATA TAGAGAGTGT GTGTGCTGTG TTTTCTCTTT 

TGTGTCGTA6 AATTGAGTCG AGTCATGGAC AAATCTGAAT CAACCAGTGC 
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TGGTCGTAAC CATC6AC6TC GTCCGCGTCG TGGTTCCCGC TCCGCCCCCT 160 

152 

CC 



(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 78 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULAR TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
GGAGGGGGCG 6AGCGGGAAC CACGACGCGG ACGACGTCGA CGGTTACGAC 
CAGCCCTGGT AGATTCAGAT TTGTCCAT 



(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 49 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULAR TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:17: 
TTTGCGTCTT AGTGTGCCTA TGGACAAATC TGGATCTCCC AATGCTAGT 



(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 49 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULAR TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:18: 
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TTTGCGTCTC AGTGTGCCTA TGGACAAATC TGGATCTCCC AATGCTAGT 



(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 56 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULAR TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
TTTGTGTCGT AGAATTGAGT CGAGTCATGG ACAAATCTGA ATCAACCAGT 
GCTGGT 



(2) INFORMATION FOR SEQ ID N0:20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 56 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULAR TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:20: 
TTTGTGTCGT AGAATTGAGT CGAGTCATGG ATAAATCTGA ATCAACCAGT 
GCTGGT 



(2) INFORMATION FOR SEQ ID N0:21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 56 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULAR TYPE: nucleic acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:21: 
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(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 56 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 



(ii) MOLECULAR TYPE: DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID N0:22: 
TTTACGTCGT AGAATTGAGT CGAGTCATGG ACAAATCTGA ATCAACCAGT 



GCTGGT 

(2) INFORMATION FOR SEQ ID N0:23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: . 56 bases 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Nucleic Acid 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
TTTGTGTCGT AGAATTGAGT CGAGTCATGG ACAAATCTGA ATCAACCAGT 



GCTGGT 



(2) INFORMATION FOR SEQ ID NO:24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 56 bases 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



WO 92/13089 - 67 - 

(ii) MOLECULAR TYPE: Nucleic Acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:24: 

TTTGTGTCGT AGAATTGAGT CGAGTCATGG ACAAATCTGA ATCAACCAGT 

GCTGGT 

(2) INFORMATION FOR SEQ ID N0:25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 56 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

( ii ) MOLECULAR TYPE : DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
TTTGTGTCGT AGAATTGAGT CGAGTCATGG ACAAATCTGA ATCAACCAGT 
GCTGGT 



(2) INFORMATION FOR SEQ ID N0:26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 59 bases 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Nucleic Acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:26: 

AATTTTGTGT CGTAGAATTG AGTCGAGTCA TGGACAAATC TGAATCAACC 

AGTGCTGCA 



(2) INFORMATION FOR SEQ ID N0:27: 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 51 bases 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Nucleic Acid 

(Xi) SEQUENCE DESCRIPTION: SEQ ID N0:27: 
GCACTGGTTG ATTCAGATTT GTCCATGACT CGACTCAATT CTACGACACA 



(2) INFORMATION FOR SEQ ID N0:28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 59 bases 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Nucleic Acid 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

AATTTTGTGT CGTAGAATTG AGTCGAGTCA TGGACAAATC T6AATCAACC 

AGTGCT6CA 



(2) INFORMATION FOR SEQ ID N0:29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Nucleic Acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 



AGCATTGGTA TCATCAGGTT TGT 
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• (2) INFORMATION FOR SEQ ID N0:30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Nucleic Acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:30: 
GTTGAT6ATG TTGTTGATTC T 21 



(2) INFORMATION FOR SEQ ID N0:31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: Amino Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:31: 



MET ASP LYS PHE ASP ASP VAL VAL ASP SER 
5 10 



(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 bases 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Nucleic Acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:32: 



ATGGACAAAT TTGATGATGT TGTTGATTCT 



30 
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(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 59 bases 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Nucleic Acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:33: 

AATTTTGTGT CGTAGAATTG AGTCGAGTCA TGGACAAATC TGAATCAACC 50 

59 

AGTGCTGCA 



(2) INFORMATION FOR SEQ ID N0:34: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 bases 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Nucleic Acid 

(Xi) SEQUENCE DESCRIPTION: SEQ ID N0:34: 
AGCCATCCTT GGTTCAG 



(2) INFORMATION FOR SEQ ID N0:35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 bases 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Nucleic Acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:35: 
GTAAGGGTGG ATGTT 



17 



15 
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(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:36: 



MET ASP LYS SER GLU LEU ARG VAL ASP VAL 
1 5 10 



(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 bases 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Nucleic Acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:37: 
ATGGACAAAT CTGAATTAAG GGTGGATGTT 30 

(2) INFORMATION FOR SEQ ID N0:38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 70 bases 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Nucleic Acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:38: 
TCTCGATGAT GTTGTTGATT CTTCTAAATC TTTTGTGATG GAAAACTTTT 50 
CTTCGTACCA CGGGACTAAA 70 



WO 92/13089 

• * 



- 72 - 



PCT/US92/00277 



(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 a™* 00 acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

MET 6LU ASN PHE SER SER TYR HIS GLY THR LYS 
1 5 10 



(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 70 bases 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Nucleic Acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:40: 

TCTCGATGAT GTTGTTGATT CTTCTAAATC TTTTGTGATT GAAAACTTTT 

CTTCGTACCA CGGGACTAAA 

(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 70 bases 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Nucleic Acid 
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:41: 

TCTCGATGAT GTTGTTGATT CTTCTAAATC TTTTGTGTTG GAAAACTTTT 



50 
70 



50 
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CTTCGTACCA CGGGACTAAA 70 

(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 78 bases 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Nucleic Acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:42: 

ATGAAGCTTC TCGATGATGT TGTTGATTCT TCTAAATCTT TTGTGATGGA 50 
AAACTTTTCT TCGTACCACG GGACTAAA 78 



(2) INFORMATION FOR SEQ ID N0:43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:43: 



MET LYS LEU LEU ASP ASP VAL VAL ASP SER SER LYS SER PHE VAL 
15 10 15 

MET GLU ASN PHE SER SER TYR HIS GLY THR LYS 
20 25 



(2) INFORMATION FOR SEQ ID N0:44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 bases 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 
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(ii) MOLECULAR TYPE: Nucleic Acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:44: 
ATGGAGAAAA AAATCACTGG ATATACCACC GTTGATATAT C 

(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:45: 

MET GLU LYS LYS ILE THR ASP SER LEU ALA VAL VAL LEU GLN ARG 
1 5 10 15 

ARG ASP 
17 

(2) INFORMATION FOR SEQ ID N0:46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 bases 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Nucleic Acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:46: 

ATGGAGAAAA AAATTACGGA TTCACTGGCC GTCGTTTTAC AACGTCGTGA 

C 

(2) INFORMATION FOR SEQ ID N0:47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 40 bases 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



41 



50 
51 
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(ii) MOLECULAR TYPE: Nucleic Acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 
ATGAAGCTAC T6TCTTCTAT CGAACAAGCA TGCGATATTT 40 



(2) INFORMATION FOR SEQ ID N0:48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY : Linear 

(ii) MOLECULAR TYPE: Peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:48: 

MET LYS LEU LEU ASP ASP VAL VAL ASP SER SER LYS SER PHE VAL 
1 5 10 15 

MET GLU ASN PHE SER 
20 



(2) INFORMATION FOR SEQ ID N0:49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 bases 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Nucleic Acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:49: 
ATGAAGCTTC TCGATGATGT TGTTGATTCT TCTAAATCTT TTGTGAT6GA 50 



AAACTTTTCT 



60 



(2) INFORMATION FOR SEQ ID N0:50: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 72 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANOEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:50: 
AUGGAGAAAA AAAUCACUGG AUAUACCACC GUUGAUAUAU CCCAAUGGCA UCGUAAAGAA 60 

72 

CAUUUUGAGG CA 

(2) INFORMATION FOR SEQ ID N0:51 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 479 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:51: 
AAGCUUCUUU ACGAUGCCAU UGGGAUAUAU CAACGGUGGU AUAAAGCCGU GGUUUUUAAA 
A6UUAUCAGG CAUGCACCUG GUAGCUAGUC UUUAAACCAA UAGAUUGCAU C6GUUUAAAA 
GGCAAGACCG UCAAAUUGCG GGAAAGGGGU CAACAGCCGU UCAGUACCAA GUCUCAGGGG 
AAACUUUGAG AUGGCCUUGC AAAGGGUAUG GUAAUAAGCU GACGGACAUG GUCCUAACCA 
CGCAGCCAAG UCCUAAGUCA ACAGAUCUUC UGUUGAUAUG GAUGCAGUAC AGACUAAAUG 
UCGGUCGGGG AAGAUGUAUU CUUCUCAUAA CAUAUAGUC6 6ACCUCUCCU UAAUGGGAGC 
UAGCGGAUGA AGUGAUGCAA CACUGGAGCC GCUGGGAACU AAUUUGUAUG C6AAAGUAUA 
UUGAUUAGUU UUGGAGUACU CGUACGGAUU CACUGGCCGU CCUGUUACAA CGUCGUGAC 
(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 479 base pairs 

(B) TYPE: nucleic acid 



60 
120 
180 
240 
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360 
420 
479 
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(C) STRANDEDNESS : both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:52: 

AA6CUUCUUU ACGAUGCCAU UG6GAUAUAU CAACGGUGGU AUAAAGCCGU 6GUUUUUAAA 60 

AGUUAUCAGG CAUGCACCUG GUAGCUAGUC UUUAAACCAA UAGAUUGCAU CGGUUUAAAA 120 

GGCAAGACCG UCAAAUUGCG G6AAAGGGGU CAACAGCCGU UCAGUACCAA GUCUCAGGGG 180 

AAACUUUGAG AUGGCCUUGC AAAGGGUAUG GUAAUAAGCU GACGGACAUG GUCCUAACCA 240 

CGCAGCCAAG UCCUAA6UCA ACAGAUCUUC UGUUGAUAUG GAUGCAGUAC AGACUAAAUG 300 

UCGGUCGGGG AAGAUGUAUU CUUCUCAUAA CAUAUAGUCG GACCUCUCCU UAAUGGGAGC 360 

UAGCGGAUGA AGUGAUGCAA CACUGGAGCC GCUGGGAACU AAUUUGUAUG CGAAAGUAUA 420 

UUGAUUAGUU UUGGAGUACU CGUACGGAUU CACUGGCCGU CCUGUUACAA CGUCSUGAC 479 
(2) INFORMATION FOR SEQ ID NO: 63: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 480 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: RNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:53: 

AAGCUUCUUU ACGAUGCCAU UGGGAUAUAU CAACGGUGGU AUAAAGCCGU GGUUUUUAAA 60 

AGUUAUCAGG CAUGCACCUG GUAGCUAGUC UUUAAACCAA UAGAUUGCAU CGGUUUAAAA 120 

GGCAAGACCG UCAAAUUGCG GGAAAGGGGU CAACAGCCGU UCAGUACCAA GUCUCAGGGG 180 

AAACUUUGAG AUGGCCUUGC AAAGGGUAUG GUAAUAAGCU GACGGACAUG GUCCUAACCA 240 

CGCAGCCAAG UCCUAAGUCA ACAGAUCUUC UGUUGAUAUG GAUGCAGUAC AGACUAAAUG 300 
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360 
420 
480 



60 
120 



UCGGUCGGGA CCGUUGAUAU AUGGUUCAUA ACAUAUAGUC GGACCUCUCC UUAAUGGGAG 
CUAGCG6AU6 AAGUGAUGCA ACACUGGAGC CGCUGGGAAC UAAUUUGUAU GCGAAAGUAU 
AUUGAUUAGU UUUGGAGUAC UCGUACGGAU UCACUGGCCG UCCUGUUACA ACGUCGUGAC 

(2) INFORMATION FOR SEQ ID N0:54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 487 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:54: 
AAGCUUCU'JU ACGAUGCCAU UGGGAUAUAU CAACGGUGGU AUAAAGCCGU GGUUUUUAAA 
AGUUAUCAGG CAUGCACCUG GUAGCUAGUC UUUAAACCAA UAGAUUGCAU CGGUUUAAAA 
GGCAAGACCG UCAAAUUGCG GGAAAGGGGU CAACAGCCGU UCAGUACCAA GUCUCAGGGG 180 
AAACUUUGAG AUGGCCUUGC AAAGGGUAUG GUAAUAAGCU GACG6ACAUG GUCCUAACCA 240 
CGCAGCCAA6 UCCUAAGUCA ACAGAUCUUC UGUUGAUAUG GAUGCAGUAC AGACUAAAUG 
UCGGUCGGGA CCGUUGAUAU AUCCCAAACG GUUCAUAACA UAUAGUCGGA CCUCUCCUUA 
AUGGGAGCUA GCGGAUGAAG UGAUGCAACA CUG6AGCCGC UGGGAACUAA UUUGUAUGCG 
AAAGUAUAUU GAUUAGUUUU GGAGUACUCG UACGGAUUCA CUGGCCGUCC UGUUACAACG 
UCGUGAC 

(2) INFORMATION FOR SEQ ID N0:55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1044 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 

GTCGACCTTT TTAAGTCGGC AAATATCGCA T6TTT6TTCG ATAQACATC6 A6TG6CTTCA 60 

AAAGTTATCA GGCATGCACC TGGTAGCTAG TCTTTAAACC AATAGATTGC ATCGGTTTAA 120 

AAGGCAAGAC CGTCAAATTG C6GGAAAGGG GTCAACAGCC GTTCAGTACC AAGTCTCAGG 180 

GGAAACTTT6 AGATGGCCTT GCAAAGGGTA TGGTAATAAG CTGACGGACA TGGTCCTAAC 240 

CACGCAGCCA A6TCCTAAGT CAACAGATCT TCTGTTGATA TGGATGCAGT TCACAGACTA 300 

AATGTCGGTC GGGGAACAAC ATGCGATATT GTTCTCATAA GATATAGTCG 6ACCTCTCCT 360 

TAAT6GGAGC TAGCGGATGA AGTGATGCAA CACTGGAGCC GCTGGGAACT AATTTGTATG 420 

CGAAAGTATA TTGATTA6TT TTGGAGTACT CGTCTCGATG ATGTTGTTGA TTCTTCTAAA 480 

TCTTTTGTGA TTGAAAACTT TTCTTCGTAC CACGGGACTA AACCTGGTTA TGTAGATTCC 540 

ATTCAAAAAG GTATACAAAA GCCAAAATCT GGTACACAAG GAAATTATGA CGATGATTGG 600 

AAAGGGTTTT ATAGTACCGA CAATAAATAC GACGCTGCGG GATACTCTGT AGATAATGAA 660 

AACCCGCTCT CTGGAAAAGC TGGAGGCGTG GTCAAAGTGA CGTATCCAGG ACTGACGAAG 720 

GTTCTCGCAC TAAAAGTGGA TAATGCCGAA ACTATTAAGA AAGAGTTAGG TTTAAGTCTC 780 

ACTGAACCGT TGATGGAGCA AGTCGGAACG GAAGAGTTTA TCAAAAGGTT CG6TGATGGT 840 

GCTTCGCGTG TAGTGCTCAG CCTTCCCTTC GCTGAGGGGA GTTCTAGCGT TGAATATATT 900 

AATAACTGGG AACAGGC6AA AGCGTTAAGC GTAGAACTTG AGATTAATTT TGAAACCCGT 960 

GGAAAACGTG GCCAAGATGC GATGTATGAG TATATGGCTC AAGCCTGTGC AGGAAATCGT 1020 

GTCAGGCGAT CTTTGTGACT CGAG 1044 
(2) INFORMATION FOR SEQ ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1047 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 



GTCGACCTTT TTAAGTCGGC AAATATCGCA 


T6TTTGTTCG ATAGACATCG AGT6GCTTCA 


60 


AAAGTTATCA GGCATGCACC TGGTAGCTA6 


TCTTTAAACC AATAGATTGC ATCGGTTTAA 


120 


AAGGCAAGAC CGTCAAATTG CGGGAAAGGG 


GTCAACAGCC GTTCAGTACC AAGTCTCAGG 


180 


GGAAACTTTG A6ATGGCCTT GCAAAG6GTA 


TGGTAATAAG CT6ACGGACA TGGTCCTAAC 


240 


CACGCAGCCA AGTCCTAAGT CAACAGATCT 


TCTGTTGATA TGGATGCAGT TCACAGACTA 


300 


AATGTCGGTC 6GGCAAACAT GCGATATTTG 


CCGTTTGTCA TAAGATATAG TCGGACCTCT 


360 


CCTTAAT6GG AGCTAGCGGA T6AAGTGATG 


CAACACTGGA GCCGCTGGGA ACTAATTTGT 


420 


ATGCGAAAGT ATATTGATTA GTTTTGGAGT 


ACTCGTCTCG ATGATGTTGT TGATTCTTCT 


480 


AAATCTTTTG TGATTGAAAA CTTTTCTTCG 


TACCACGGGA CTAAACCTGG TTATGTAGAT 


540 


TCCATTCAAA AAGGTATACA AAAGCCAAAA 


TCT6GTACAC AAGGAAATTA TGACGATGAT 


600 


TG6AAAGGGT TTTATAGTAC CGACAATAAA 


TACGACGCTG CGGGATACTC TGTAGATAAT 


660 


GAAAACCCGC TCTCTGGAAA AGCTGGAGGC 


6TGGTCAAAG T6ACGTATCC AGGACTGAC6 


720 


AAGGTTCTCG CACTAAAAGT GGATAAT6CC 


6AAACTATTA AGAAAGAGTT AGGTTTAAGT 


780 


CTCACTGAAC CGTTGATGGA GCAAGTCGGA 


ACGGAAGAGT TTATCAAAAG 6TTCGGTGAT 


840 


GGTGCTTCGC GTGTAGTGCT CAGCCTTCCC 


TTCGCTGAGG GGAGTTCTAG C6TTGAATAT 


900 


ATTAATAACT GGGAACAGGC GAAA6CGTTA 


AGCGTAGAAC TTGAGATTAA TTTTGAAACC 


960 


CGTGGAAAAC GTGGCCAAGA TGCGATGTAT 


GAGTATATGG CTCAA6CCTG TGCAGGAAAT 


1020 


CGTGTCAGGC GATCTTTGTG ACTC6AG 
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What is claimed is: 

1. A polynucleotide molecule, said molecule 
encoding a trans-splicing ribozyme, the sequence of 
said ribozyme being a fusion RNA, the sequence of such 
fusion RNA comprising: 

(1) a first RNA sequence, said first RNA 
sequence being sufficient for targeting said 
ribozyme to hybridize to a target RNA, and 
(2) a second RNA sequence, said second RNA 
sequence being capable of being transferred 
colinearly into a target RNA as a result of 
the trans-splicing activity of said 
ribozyme; 

wherein expression of said polynucleotide molecule is 
operably linked to expression of a transcriptional 
activator protein, and wherein said first RNA sequence 
is a sequence that hybridizes to an RNA that encodes 
said transcriptional activator protein. 

2. The polynucleotide molecule of claim 1, 
wherein said transcriptional activator is GAL4 . 

3. The polynucleotide molecule of claim 1, 
wherein said second RNA sequence comprises a sequence 

25 that encodes a peptide toxic to the host cell. 

4. The polynucleotide molecule of claim 3, 
wherein said peptide is the DTA peptide. 

30 5 # The polynucleotide molecule of claim 4, 

wherein said DTA peptide is a mutant peptide sequence. 

6. The polynucleotide molecule of claim 5, 
wherein said mutant peptide sequence comprises amino 
35 acids encoded by SEQ ID. No. 40. 



10 



WO 92/13089 



PCT/US92/00277 



- 82 - 



7. The polynucleotide molecule of claim 5, 
wherein said mutant peptide sequence comprises amino 
acids encoded by SEQ ID. No. 41. 

8. The polynucleotide molecule of claim 1, 
wherein said first RNA sequence is a sequence that 
hybridizes to GAL4 RNA and wherein said second RNA 
sequence is a sequence that encodes the DTA peptide. 

9. The polynucleotide molecule of any one of 
claims 1-8, wherein said molecule is RNA. 

10. The polynucleotide molecule of any one of 
claims 1-8, wherein said molecule is DNA. 

11. A polynucleotide molecule comprising a 
ribozyme expression cassette, said cassette being 
capable of being stably inserted into the genome of a 
host, and said cassette comprising the sequence of a 
promoter capable of functioning in such host, operably- 
linked to the coding sequence of the polynucleotide of 
any one of claims 1-9. 

12. A host cell comprising the polynucleotide 
molecule of claim 11. 

13. The host cell of claim 12, wherein said host 
cell is a viral cell. 

14. The host cell of claim 12, wherein said host 
cell is a prokaryotic cell. 

15. The host cell of claim 12, wherein said host 
cell is a eukaryotic cell. 
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16. The host cell of claim 15 , wherein said 
eukaryotic cell is a plant cell. 

17. The host cell of claim 15, wherein said 
eukaryotic cell is an animal cell. 

18* The host cell of claim 17, wherein said 
animal is Drosophila. 

19. The host cell of claim 17, wherein said 
animal is a mammal. 

20. The. host cell of claim 19, wherein said 
animal is human. 

21. A method for in vitro trans-splicing, such 
method comprising the steps of: 

(1) providing the polynucleotide molecule of 
claim 9 in a trans-splicing reaction mixture, said 
polynucleotide comprising a sequence capable of 
hybridizing with a second polynucleotide; 

(2) providing said second polynucleotide to such 
reaction mixture; and 

(4) catalyzing the trans-splicing of said second 
polynucleotide under said conditions. 

22. A method for in vivo trans-splicing, said 
method comprising the steps of: 

(1) providing a polynucleotide of claim 9 to a 
host cell; 

(2) expressing said ribozyme encoded by said 
molecule in said host cell; 

(3) expressing a substrate of said ribozyme in 
said host cell; and 

(4) catalyzing the trans-splicing of said ribozyme 
with such substrate in said host cell. 
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23. A method for inactivating the activity of a 
target RNA, said method comprising: 

(1) providing the polynucleotide of claim 9 to a 
trans-splicing reaction mixture, said ribozyme 
possessing catalytic activity against a target RNA, 
said catalytic activity resulting in the inactivation 
of the functioning of said target RNA; 

(2) providing said target RNA to said mixture; and 

(3) providing conditions that allow said 
polynucleotide to express said catalytic activity. 

24. A method for providing a desired genetic 
sequence to a host cell in vivo, said method 
comprising: 

(1) providing the polynucleotide of claim 9 to 
said host cell, said polynucleotide possessing 
catalytic activity against a target RNA in said host 
cell, said ribozyme being capable of trans-splicing 
said desired genetic sequence; 

(2) providing said target RNA in said host cell; 

and 

(3) providing conditions that allow said ribozyme 
to trans-splice said desired genetic sequence into the 
sequence of said target RNA. 

25. A method for cell ablation in multicellular 
plants and animals, said method comprising providing 
the polynucleotide of claim 9 to a fertilized embryonic 
host cell, said ribozyme encoding the sequence of a 
peptide toxic to said host cell and said ribozyme being 
capable of trans-splicing said sequence into a target 
in said host cell. 

26. A method for engineering male or female 
sterility in a plant, said method comprising providing 
the polynucleotide of claim 9 to a germ cell of said 
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species, said ribozyme being targeted to an RNA that, 
when expressed as a protein, is necessary for the 
fertility of said plant and results in ablation of the 
cell that expresses said protein. 

27. A method for immunizing plants against a 
plant pathogen, said method comprising transforming 
plant cells with the polynucleotide of claim 9, wherein 
said polynucleotide encodes a trans-splicing sequence 
capable of providing immunity against said pathogen to 
said plant, and wherein infection of a cell from said 
plant with said pathogen results in the ablation of 
said cell. 

28. The polynucleotide molecule of any one of 
claims 1-7 wherein said ribozyme is a pro-ribozyme. 

29. The polynucleotide molecule of any one of 
claims 8 wherein said ribozyme is a pro-ribozyme. 

30. The polynucleotide molecule of any one of 
claims 9 wherein said ribozyme is a pro-ribozyme. 

31. The polynucleotide molecule of any one of 
claims 10 wherein said ribozyme is a pro-ribozyme. 

32. The polynucleotide molecule of claim 11, 
wherein said ribozyme is a pro-ribozyme. 

33. The host cell claim 12, wherein said ribozyme 
is a pro-ribozyme. 

34. The method of claim 21, wherein said ribozyme 
is a pro-ribozyme. 
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35. The method of claim 22, wherein said ribozyme 
is a pro-ribozyme. 

36. The method of claim 23, wherein said ribozyme 
is a pro-ribozyme. 

37. The method of claim 24, wherein said ribozyme 
is a pro-ribozyme. 

38. The method of claim 25, wherein said ribozyme 
is a pro-ribozyme. 

39. The method of claim 26, wherein said ribozyme 
is a pro-ribozyme. 

40. .The method of claim 27, wherein said ribozyme 
is a pro-ribozyme. 
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! Self-splicing of Group I introns. 



1. Guanosine-mediated cleavage of 5' splice site. 
6© 




mmmm 



3* exon 



2. Cleavage of 3' splice site. 




3' exon 



3. Ligation of exon segments. 



5* exon 



3* exon 



4. Circularization of intron. 



G v ':-^ r t^ 




WO 92/13089 



2/20 



PCT/US92/00277 



o 

a. 
■ 

C/) 

c 

CO 



0) 



N 
O 
£2 



c 
o 



< 

cc 

O) 

"EL 

co 



o 



£ I 



a> 



§■ 

Q> 



< P 
O O — < 

V— < 



O « 



a> 



CO 

C 

1 
."3 



o 
u 
o 
o 

o 

6 

- M 

O 
.O 

u 



1 



1 



2 
Z 
Z 

z y> 
z — c 

Z — ■ C 

z — a* 



z — c # ^ 
z — e 

Z — 5 

z — e 
z — c" 
z — 
z — c r 
z — c 
e : — z 
c — z 



z 
z 
z 



o 

£ — Z i-1 

■ a* 

C — Z 
G*— Z 

z — a^— z 

z — c— z 

« * 

35 — * ,55 



55— C 



9 

u 
0 
o 



>1 

t4 

O 



WO 92/13089 



3/2 0 



PCT/US92/00277 



7> 



=> — < 



>> 

N 

O 

Si 

1- 

o 
c 

o 

Q. 

CO 
I 

c/> 
c 
cu 

o 

T3 

Q> 
Q. 

N 
O 
CO 

-J 

< 



c 
o 



< 

DC 
C 

o 

(0 
» 

to 



o 

c 

o> 

0) 
Q 



i 

S- 

! 

g 



O 



< 



O — rn H 



O 
r-l 



H O— O 

- % o— o 



0 

U 

s 



H 
O 



E 
>* 

Nl 

o 

ra 
o 

CO 
<0 

c 

CO 
CO 

-J 

8 



— CM 



3.0 

cnz> 
(0 O 

CO O 



2< 

9 => 
H O 

>o 

HO 
« D 
> O 

<3 O 

nj O 

3 O 

HO 

o 5 



;s o 

3 D 

H < 

» < 
>»2 



3C5 

) ca 



o 



00 



WO 92/13089 



4/2 0 



PCT/US92/00277 



•II DKA Ctcl4«r 
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7-1C aCAT-Lecl *t*-Ri -> List 



*A sequence • <23 b.p. CCCXGACCCGfcA 



I! 



t 1° Jj I 20 | 30 

1 CCSACA C CC C AACCT7CTT7 AOGATGCCAT 
€1 GCT7TTTAAA AGJ^TATCACG CATCCACCTC 
121 CCCCTUUA CCCAACAOCC TCAAATTGCC 
181 C7C7CASCCC M^CTTTCAC MCCCCTTGC 
241 CTCCXAACCA CCCACOCAAC 3CCZAASTC* 
301 AGA6ACTAAA lcVcGGTCGC CGAAGATCXA 
Ml CTJAAWOGA CCJADCCGAX CWWICAJCC 
421 TQCCAAASTA SATTCA3XA6 TTTTCCACTA 
461 KACCTCCtGA CTpOlAWAC CCTOCCCTX* 
541 CTRG0OCAC ClfWCCIAAI AGCCAA&ACC 
€01 CCACOCXCAX 5CCAAATTCT AA£ 



I 10 



20 



I 30 



GGAAATTCTAAG 1 ine tr 



I 40 
TCGCATATAT 
CTACCTACTC 



ACfGATCTXC 
TTCTTCTCAI 



ClCulXCCCA 
Op^AACTTAA 
CCQQCACCCA 

i 1 40 



I so 

CAACG0ICCT 
TTTAAACCAA 
CAACAQOCCT 
CXAATAACCT 
TCTTCATATC 
AACASAIACT 

TTCACTCCCC 
TOGCCTTCCA 
TCOOCCtTCC 

1 50 



t 60 
AXAAACCCC7 €0 
ZACA7TCCA? 120 
TCACXACCAA 180 
CACCCACATC 240 
CATG&GTTC 300 
C CC MCTCH 360 
CtAAOTCTA 420 
CTCCXTXZAC 4t0 
OCACA3CCCC 540 
CAACACTTCC COO 
C23 
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nospc wus $NA 4 trans-splicing ribozymes. 



Virus RNA target sequences. I 

CMV isolates, jj , 

ii • I 
UDUGCGUCU.. ..... .UAGOGUGCCU AO<j GAC 

WKKJCGOCDCAG.., . . . , .OGOGCCU A]2£ GAC 

uuugugucguagaauugagucgaguc ins GAC 

WUGOGUCOTAGAAt^GAGOCGXCTOC A2S GAP 
WOGtfGOCG^AGMOWSAOTCGAGTJC iflfi GAC 
TODACGUCG^GAAUUGAGUCGAGUC IBS GAC 
VUDGUCUCGUAGAAUU6AGUCGAGUC 2XZS GAC 
PPPGU SDCGflAGAAUUGAGUCGA^ ^jg 
WOGOGDCGUAGAAUUGAGUCGAG^ AkS GAC 

Oligonucleotide target sequence's. 

^^'W'GTGlCGraGAAirTGAGTCGAGTC *ifi GAC 

nimijiumniiiiuiiii in m 

r *^«gcatctiXactcagctcag tac CTG 
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AAA U.CD 
AAA U.CD 
AAA U.CD 
AAA CCD 
AAA U.CD 
AAA U.CD 
AAA U.CU 
AAA V.CU 
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GGA U.CU 
GGA U.CD 
GAA U.CA 
GAA U.CA 
GAA U.CA 
GAA U.CA 
GAA U.CA 
GAA U.CA 
GAA U.CA 
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CCC AAU 
CCC AAU 
ACC AGU 
ACC AGU 
ACC AGU 
ACC AGU 
ACC AGU 
ACC AGU 
ACC AGU 



GCO AGU 
GCU AGU 
GCU GGU 
GCU GGU 
GCU GGU 
GCU GGU — 
GCU GGU— 
GCU GGU - 
GCU GGU - 
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diphtheria toxin ;A-chain trans-splicing ribozymes. 
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» .UGUUUG GACUACOAUGGUUACGA ... 5 » 
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RIBOZXME CORE G.UOGAUGAUGUUGUDGAUUCU 3' 
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met asp lys phe asp asp val val asp ser 
AUG GAC AAA UUU GAD GAU GUU GUU GAU UCU 
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! : II III* III III (46 bp) 

!i : '• GACUUG GUUCCUACC6A ... S' 

■': : I "1111*111 

RIBOZYME CORE G . UAAGGGUGGAUGUU . 3' 



%3 
35- 



met asp lys ser gl« leu arg val asp val 
AUG GAC AAA UCU GAA UUA AGG GUG GAU GUU 
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T**aC*J<-D? t*R2 KDTAUT Liit 

DKA sequ«ac« • 1036 b.p. C7CCACCTTTT1 

' " II « .20 f 30 

I CTCCXCCXIT TTAACTCCCC AAATATCGGA 
61 AAACTXATCA CCCATCCXCC TGCtACCXAC 
«J AACGCAAOC OCTCAAXTTC CGCGAAACGC 

m ccwAcrnc acatcccctt ccmacccta 

241 CACGCACCCA AC70CTAAC? CAACA£A?CT 
301 MTCTOCCTC CCOGMCA^ TATTCTTCTC 
361 CAGCTACCCC AXGAASX*LT GGAAGAClt* 
421 XATATTGWT A CffHUlA C TACtCClCTC 
401 CTGATTGAAA ACJTTICXTC CtAOCACOGC 
511 AAAGGXAXAC AAMGCGAAA ATCTOCXACA 
401 STTTAXAGtA COCACAATAA AtACGACCCT 
461 CJCTCICCAA AACCTt«(5C CGTCCSGAAA 
721 COCm XAQ lcfaUEAAICC CGAAACTATX 
711 C067X6KJC6 AfifMClCCC AAOQG&ACAC 
«1 CCiCLACTCC ICjWCCTlbC CTTCCC3CAC 
Kl T S CCAACA C0 CGAAAGCGTt AAGCGXACAA 
561 CCTGGCCAAC AlfCGATCTA TCACtAXATC 
1021 CCATCTTOT GACTCGAC 
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TTCTCACTCCAC lii*«*r 
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I 40 

TCTTTCTTCC 
TCTTXAAACC 
CT9AACACOC 
TGGTAAXAAC 

AftjkACAZASA 
AOOOOCTOOC 
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ACfAAACCXC 
CAAC SAAATT 
OO^OQAZACT 
CTGAOCXA2C 
AAQAAACMT 
TTSATCAAAA 
OQCA C1IC1A 
CTTGAOAT2A 
OCTCAAGCCT 
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AXAGACATCC 
AAXACAXXGC 
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CTCA CC CACA 
TCGAJQCACT 
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CTXAlCtAGA 
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CTCIAGAZAA 
CAOGACTCAC 
XAGGXTtAAC 
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ACTCOCTTCA 
AXOGCXXTAA 
AACTCTCA CC 
TGCTOCXAAC 
TCACAC ACTA 
TCCX1AASCC 
ZA3G0SAAAC 
tAA AK ir n 
S30CASTCAA 
TIm«AAAfiGG 
tCAAAAOOOG 
CAACCnCIC 
TCKAC3GAA 



OCTT09CTCA MOIUU1HJU 



I 20 



f 30 



1 

1 I 



40 



OOCXSCAATA 
AICTTGAAAC 

GUjUIGGAAA 

I. 50 



XAIXAASAAC 
CCCTGCAAAA 

loncicAcc 

I 60 



£0 

120 

ISO 

240 

300 

340 

420 

490 

540 

€00 

460 

120 

190 

•40 

$00 

540 

1020 

103E 



WO 92/13089 



9/2 0 



PCI7US92/00277 



GaMgene 
frATA^^^^aass^^ 



P-element mediated insertion 
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tissue-specific expression, of Gal4. protein 
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transcriptional activation of gene X 



upsteam activator sequences (UAS) 



P-element mediated "enhancer-trapping" for expression of 
Ga!4 protein. , 
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QAL4 VECTOR (pGaTB or pGaTN) 
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(Unique sites in italics 5 ; 3' Notl site is unique in pGaTB only) 
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"Pro-ribozyme" design. 

(1). CAT-LecZ a-peptide trans-splicing ribozyme 

t i 
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Modification o* helix P8 for pro-ribozjme construction. 
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